Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.07497
Cited By
v1
v2 (latest)
A comprehensive study of speech separation: spectrogram vs waveform separation
17 May 2019
F. Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
Dong Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A comprehensive study of speech separation: spectrogram vs waveform separation"
40 / 40 papers shown
Title
Attention-Based Beamformer For Multi-Channel Speech Enhancement
Jinglin Bai
Hao Li
Xueliang Zhang
Fei Chen
49
0
0
10 Sep 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Guinan Li
Jiajun Deng
Youjun Chen
Mengzhe Geng
Shujie Hu
...
Zengrui Jin
Tianzi Wang
Xurong Xie
Helen Meng
Xunying Liu
VLM
56
0
0
14 Jun 2024
HPCNeuroNet: Advancing Neuromorphic Audio Signal Processing with Transformer-Enhanced Spiking Neural Networks
Murat Isik
Hiruna Vishwamith
Kayode Inadagbo
I. C. Dikmen
67
6
0
21 Nov 2023
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Guinan Li
Jiajun Deng
Mengzhe Geng
Zengrui Jin
Tianzi Wang
Shujie Hu
Mingyu Cui
Helen M. Meng
Xunying Liu
60
12
0
06 Jul 2023
The Intel Neuromorphic DNS Challenge
Jonathan Timcheck
S. Shrestha
D. B. Rubin
A. Kupryjanow
Garrick Orchard
Lukasz Pindor
Timothy M. Shea
Mike Davies
65
28
0
16 Mar 2023
Multi-Channel Masking with Learnable Filterbank for Sound Source Separation
Wang Dai
Archontis Politis
Tuomas Virtanen
50
0
0
14 Mar 2023
VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Aoting Zhang
He Wang
Pengcheng Guo
Yihui Fu
Linfu Xie
Yingying Gao
Shilei Zhang
Junlan Feng
71
5
0
27 Feb 2023
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
AI4TS
88
25
0
16 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
117
22
0
01 Dec 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation
Kashyap Patel
A. Kovalyov
Issa Panahi
50
6
0
28 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
105
19
0
05 Oct 2022
Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
S. Araki
33
20
0
07 May 2022
Audio-visual multi-channel speech separation, dereverberation and recognition
Guinan Li
Jianwei Yu
Jiajun Deng
Xunying Liu
Helen Meng
73
7
0
05 Apr 2022
Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition
Junhao Xu
Jianwei Yu
Xunying Liu
Helen Meng
MQ
48
10
0
29 Nov 2021
Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithms
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
Naoyuki Kamo
S. Araki
69
8
0
20 Nov 2021
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
S. Araki
60
19
0
04 Aug 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
68
0
0
23 Jul 2021
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation
Martin Strauss
Jouni Paulus
Matteo Torcoli
B. Edler
51
9
0
16 Jun 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Meng Yu
Chunlei Zhang
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
55
31
0
02 Apr 2021
Continuous Speech Separation with Ad Hoc Microphone Arrays
Dongmei Wang
Takuya Yoshioka
Zhuo Chen
Xiaofei Wang
Tianyan Zhou
Zhong Meng
43
27
0
03 Mar 2021
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
115
80
0
16 Feb 2021
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Z. Zhang
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Donald Williamson
Dong Yu
38
29
0
24 Dec 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
76
46
0
11 Nov 2020
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
32
6
0
01 Jul 2020
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Thilo von Neumann
Christoph Boeddeker
Lukas Drude
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
81
41
0
04 Jun 2020
End-to-End Multi-Look Keyword Spotting
Meng Yu
Xuan Ji
Bo Wu
Dan Su
Dong Yu
52
19
0
20 May 2020
Jointly optimal denoising, dereverberation, and source separation
Tomohiro Nakatani
Christoph Boeddeker
K. Kinoshita
Rintaro Ikeshita
Marc Delcroix
Reinhold Haeb-Umbach
47
46
0
20 May 2020
Audio-visual Multi-channel Recognition of Overlapped Speech
Jianwei Yu
Bo Wu
R. Yu
Shi-Xiong Zhang
Lianwu Chen
Yong Xu. Meng Yu
Dan Su
Dong Yu
Xunying Liu
Helen Meng
98
19
0
18 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
94
157
0
08 May 2020
Neural Spatio-Temporal Beamformer for Target Speech Separation
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Chao Weng
Jianming Liu
Dong Yu
82
41
0
08 May 2020
Neural Speech Separation Using Spatially Distributed Microphones
Dongmei Wang
Zhuo Chen
Takuya Yoshioka
53
39
0
28 Apr 2020
An empirical study of Conv-TasNet
Berkan Kadıoğlu
Michael Horgan
Xiaoyu Liu
Jordi Pons
Dan Darcy
Vivek Kumar
42
44
0
20 Feb 2020
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss
Rui Liu
Berrak Sisman
F. Bao
Guanglai Gao
Haizhou Li
122
14
0
02 Feb 2020
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
109
217
0
30 Jan 2020
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Marc Delcroix
Tsubasa Ochiai
Kateřina Žmolíková
K. Kinoshita
Naohiro Tawara
Tomohiro Nakatani
S. Araki
129
124
0
23 Jan 2020
End-to-end training of time domain audio separation and recognition
Thilo von Neumann
K. Kinoshita
Lukas Drude
Christoph Boeddeker
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
76
34
0
18 Dec 2019
A Unified Framework for Speech Separation
F. Bahmaninezhad
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
John H. L. Hansen
Dong Yu
38
4
0
17 Dec 2019
Demystifying TasNet: A Dissecting Approach
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb-Umbach
63
58
0
20 Nov 2019
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
127
776
0
14 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
117
131
0
03 Sep 2019
1