ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.07497
  4. Cited By
A comprehensive study of speech separation: spectrogram vs waveform
  separation
v1v2 (latest)

A comprehensive study of speech separation: spectrogram vs waveform separation

17 May 2019
F. Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
Dong Yu
ArXiv (abs)PDFHTML

Papers citing "A comprehensive study of speech separation: spectrogram vs waveform separation"

40 / 40 papers shown
Title
Attention-Based Beamformer For Multi-Channel Speech Enhancement
Attention-Based Beamformer For Multi-Channel Speech Enhancement
Jinglin Bai
Hao Li
Xueliang Zhang
Fei Chen
49
0
0
10 Sep 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech
  Separation and Recognition
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Guinan Li
Jiajun Deng
Youjun Chen
Mengzhe Geng
Shujie Hu
...
Zengrui Jin
Tianzi Wang
Xurong Xie
Helen Meng
Xunying Liu
VLM
56
0
0
14 Jun 2024
HPCNeuroNet: Advancing Neuromorphic Audio Signal Processing with
  Transformer-Enhanced Spiking Neural Networks
HPCNeuroNet: Advancing Neuromorphic Audio Signal Processing with Transformer-Enhanced Spiking Neural Networks
Murat Isik
Hiruna Vishwamith
Kayode Inadagbo
I. C. Dikmen
67
6
0
21 Nov 2023
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation
  and Recognition
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Guinan Li
Jiajun Deng
Mengzhe Geng
Zengrui Jin
Tianzi Wang
Shujie Hu
Mingyu Cui
Helen M. Meng
Xunying Liu
60
12
0
06 Jul 2023
The Intel Neuromorphic DNS Challenge
The Intel Neuromorphic DNS Challenge
Jonathan Timcheck
S. Shrestha
D. B. Rubin
A. Kupryjanow
Garrick Orchard
Lukasz Pindor
Timothy M. Shea
Mike Davies
65
28
0
16 Mar 2023
Multi-Channel Masking with Learnable Filterbank for Sound Source
  Separation
Multi-Channel Masking with Learnable Filterbank for Sound Source Separation
Wang Dai
Archontis Politis
Tuomas Virtanen
50
0
0
14 Mar 2023
VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Aoting Zhang
He Wang
Pengcheng Guo
Yihui Fu
Linfu Xie
Yingying Gao
Shilei Zhang
Junlan Feng
71
5
0
27 Feb 2023
Towards Unified All-Neural Beamforming for Time and Frequency Domain
  Speech Separation
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
AI4TS
88
25
0
16 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
117
22
0
01 Dec 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time
  Time-domain Audio Separation
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation
Kashyap Patel
A. Kovalyov
Issa Panahi
50
6
0
28 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
105
19
0
05 Oct 2022
Mask-based Neural Beamforming for Moving Speakers with
  Self-Attention-based Tracking
Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
S. Araki
33
20
0
07 May 2022
Audio-visual multi-channel speech separation, dereverberation and
  recognition
Audio-visual multi-channel speech separation, dereverberation and recognition
Guinan Li
Jianwei Yu
Jiajun Deng
Xunying Liu
Helen Meng
73
7
0
05 Apr 2022
Mixed Precision DNN Qunatization for Overlapped Speech Separation and
  Recognition
Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition
Junhao Xu
Jianwei Yu
Xunying Liu
Helen Meng
MQ
48
10
0
29 Nov 2021
Switching Independent Vector Analysis and Its Extension to Blind and
  Spatially Guided Convolutional Beamforming Algorithms
Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithms
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
Naoyuki Kamo
S. Araki
69
8
0
20 Nov 2021
Blind and neural network-guided convolutional beamformer for joint
  denoising, dereverberation, and source separation
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
S. Araki
60
19
0
04 Aug 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency
  Domain Features and a Pre-trained Acoustic Model
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
68
0
0
23 Jul 2021
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer
  Learning from Music Source Separation
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation
Martin Strauss
Jouni Paulus
Matteo Torcoli
B. Edler
51
9
0
16 Jun 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality
  Assessment
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Meng Yu
Chunlei Zhang
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
55
31
0
02 Apr 2021
Continuous Speech Separation with Ad Hoc Microphone Arrays
Continuous Speech Separation with Ad Hoc Microphone Arrays
Dongmei Wang
Takuya Yoshioka
Zhuo Chen
Xiaofei Wang
Tianyan Zhou
Zhong Meng
43
27
0
03 Mar 2021
Deep Learning based Multi-Source Localization with Source Splitting and
  its Effectiveness in Multi-Talker Speech Recognition
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
115
80
0
16 Feb 2021
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Z. Zhang
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Donald Williamson
Dong Yu
38
29
0
24 Dec 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant
  Environments
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
76
46
0
11 Nov 2020
Exploring the time-domain deep attractor network with two-stream
  architectures in a reverberant environment
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
32
6
0
01 Jul 2020
Multi-talker ASR for an unknown number of sources: Joint training of
  source counting, separation and ASR
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Thilo von Neumann
Christoph Boeddeker
Lukas Drude
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
81
41
0
04 Jun 2020
End-to-End Multi-Look Keyword Spotting
End-to-End Multi-Look Keyword Spotting
Meng Yu
Xuan Ji
Bo Wu
Dan Su
Dong Yu
52
19
0
20 May 2020
Jointly optimal denoising, dereverberation, and source separation
Jointly optimal denoising, dereverberation, and source separation
Tomohiro Nakatani
Christoph Boeddeker
K. Kinoshita
Rintaro Ikeshita
Marc Delcroix
Reinhold Haeb-Umbach
47
46
0
20 May 2020
Audio-visual Multi-channel Recognition of Overlapped Speech
Audio-visual Multi-channel Recognition of Overlapped Speech
Jianwei Yu
Bo Wu
R. Yu
Shi-Xiong Zhang
Lianwu Chen
Yong Xu. Meng Yu
Dan Su
Dong Yu
Xunying Liu
Helen Meng
98
19
0
18 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
94
157
0
08 May 2020
Neural Spatio-Temporal Beamformer for Target Speech Separation
Neural Spatio-Temporal Beamformer for Target Speech Separation
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Chao Weng
Jianming Liu
Dong Yu
82
41
0
08 May 2020
Neural Speech Separation Using Spatially Distributed Microphones
Neural Speech Separation Using Spatially Distributed Microphones
Dongmei Wang
Zhuo Chen
Takuya Yoshioka
53
39
0
28 Apr 2020
An empirical study of Conv-TasNet
An empirical study of Conv-TasNet
Berkan Kadıoğlu
Michael Horgan
Xiaoyu Liu
Jordi Pons
Dan Darcy
Vivek Kumar
42
44
0
20 Feb 2020
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss
Rui Liu
Berrak Sisman
F. Bao
Guanglai Gao
Haizhou Li
122
14
0
02 Feb 2020
Continuous speech separation: dataset and analysis
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
109
217
0
30 Jan 2020
Improving speaker discrimination of target speech extraction with
  time-domain SpeakerBeam
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Marc Delcroix
Tsubasa Ochiai
Kateřina Žmolíková
K. Kinoshita
Naohiro Tawara
Tomohiro Nakatani
S. Araki
129
124
0
23 Jan 2020
End-to-end training of time domain audio separation and recognition
End-to-end training of time domain audio separation and recognition
Thilo von Neumann
K. Kinoshita
Lukas Drude
Christoph Boeddeker
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
76
34
0
18 Dec 2019
A Unified Framework for Speech Separation
A Unified Framework for Speech Separation
F. Bahmaninezhad
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
John H. L. Hansen
Dong Yu
38
4
0
17 Dec 2019
Demystifying TasNet: A Dissecting Approach
Demystifying TasNet: A Dissecting Approach
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb-Umbach
63
58
0
20 Nov 2019
Dual-path RNN: efficient long sequence modeling for time-domain
  single-channel speech separation
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
127
776
0
14 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
117
131
0
03 Sep 2019
1