Overlapped Speech Separation System (OSSS) / NC Eemaan Aziz, NC Hashir Rizwan, NC Ayesha Riaz, NC Usman Awan.

By: Aziz, EemaanContributor(s): Supervisor Dr. Shibli NisarMaterial type: TextTextPublisher: MCS, NUST Rawalpindi 2024Description: 84 pSubject(s): UG EE Project | BEE-57DDC classification: 621.382,AZI
Contents:
Overlapped Speech refers to speakers speaking simultaneously (i.e. speech mixture). Speech separation has long been an active research topic in the signal processing community, with its importance in a wide range of applications such as hearable devices and telecommunication systems. It is a fundamental problem for all higher-level speech processing tasks. With recent progress in deep neural networks, the separation performance has been significantly advanced by various new problems. The problem formulation of time-domain, end-to-end speech separation naturally arises to tackle the disadvantages in frequency-domain systems. We’ve used a dual path recurrent neural network for separation of mixed audios. DPRNN (Dual-Path RNN) primarily separates in the time domain for audio source separation. It leverages recurrent neural networks (RNNs) to process temporal sequences of audio data. DPRNN focuses on exploiting temporal dependencies within audio signals for effective separation. We looked into the training objectives for separating and improving the robustness under reverberant environments. This project is further analyzed and can be used as the basis for future works.
Tags from this library: No tags from this library for this title. Log in to add tags.
Item type Current location Home library Shelving location Call number Status Date due Barcode Item holds
Project Report Project Report Military College of Signals (MCS)
Military College of Signals (MCS)
General Stacks 621.382,AZI (Browse shelf) Available MCSPTC-478
Total holds: 0

Overlapped Speech refers to speakers speaking simultaneously (i.e. speech mixture). Speech separation has long been an active research topic in the signal processing community, with its importance in a wide range of applications such as hearable devices and telecommunication systems. It is a fundamental problem for all higher-level speech processing tasks.
With recent progress in deep neural networks, the separation performance has been significantly advanced by various new problems. The problem formulation of time-domain, end-to-end speech separation naturally arises to tackle the disadvantages in frequency-domain systems. We’ve used a dual path recurrent neural network for separation of mixed audios.
DPRNN (Dual-Path RNN) primarily separates in the time domain for audio source separation. It leverages recurrent neural networks (RNNs) to process temporal sequences of audio data. DPRNN focuses on exploiting temporal dependencies within audio signals for effective separation. We looked into the training objectives for separating and improving the robustness under reverberant environments. This project is further analyzed and can be used as the basis for future works.

There are no comments on this title.

to post a comment.
© 2023 Central Library, National University of Sciences and Technology. All Rights Reserved.