University of Leicester
Browse
- No file added yet -

Two-Stage Monaural Source Separation in Reverberant Room Environments using Deep Neural Networks

Download (2.48 MB)
journal contribution
posted on 2019-05-15, 08:51 authored by Yang Sun, Wenwu Wang, Jonathon Chambers, Syed Mohsen Naqvi
Deep neural networks (DNNs) have been used for dereverberation and separation in the monaural source separation problem. However, the performance of current state-of-the-art methods is limited, particularly when applied in highly reverberant room environments. In this paper, we propose a two-stage approach with two DNN-based methods to address this problem. In the first stage, the dereverberation of the speech mixture is achieved with the proposed dereverberation mask (DM). In the second stage, the dereverberant speech mixture is separated with the ideal ratio mask (IRM). To realize this two-stage approach, in the first DNN-based method, the DM is integrated with the IRM to generate the enhanced time-frequency (T-F) mask, namely the ideal enhanced mask (IEM), as the training target for the single DNN. In the second DNN-based method, the DM and the IRM are predicted with two individual DNNs. The IEEE and the TIMIT corpora with real room impulse responses and noise from the NOISEX dataset are used to generate speech mixtures for evaluations. The proposed methods outperform the state-of-the-art specifically in highly reverberant room environments.

History

Citation

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2019, 27(1) , pp. 125 - 139

Author affiliation

/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Engineering

Version

  • AM (Accepted Manuscript)

Published in

IEEE/ACM Transactions on Audio

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

issn

1558-7916

Acceptance date

2018-10-01

Copyright date

2018

Available date

2019-05-15

Publisher version

https://ieeexplore.ieee.org/abstract/document/8494775

Language

en

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC