Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.02508
Cited By
SDR - half-baked or well done?
6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDR - half-baked or well done?"
50 / 614 papers shown
Title
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
29
41
0
30 Jun 2021
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders
Xiaoyu Bie
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
DiffM
38
54
0
23 Jun 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
19
142
0
22 Jun 2021
Improving On-Screen Sound Separation for Open-Domain Videos with Audio-Visual Self-Attention
Efthymios Tzinis
Scott Wisdom
Tal Remez
J. Hershey
VLM
32
8
0
17 Jun 2021
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation
Martin Strauss
Jouni Paulus
Matteo Torcoli
B. Edler
31
8
0
16 Jun 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
23
22
0
15 Jun 2021
Few-shot learning of new sound classes for target sound extraction
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
S. Araki
VLM
29
11
0
14 Jun 2021
Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Keitaro Tanaka
Ryosuke Sawata
Shusuke Takahashi
27
0
0
04 Jun 2021
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation
Scott Wisdom
A. Jansen
Ron J. Weiss
Hakan Erdogan
J. Hershey
43
26
0
01 Jun 2021
Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRL
20
15
0
19 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
28
23
0
17 May 2021
Move2Hear: Active Audio-Visual Source Separation
Sagnik Majumder
Ziad Al-Halah
Kristen Grauman
21
44
0
15 May 2021
Separate but Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data
Efthymios Tzinis
Jonah Casebeer
Zhepei Wang
Paris Smaragdis
FedML
32
19
0
11 May 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation
Sunwoo Kim
Minje Kim
36
19
0
08 May 2021
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Aswin Sivaraman
Minje Kim
21
9
0
08 May 2021
Weakly Supervised Source-Specific Sound Level Estimation in Noisy Soundscapes
Aurora Cramer
M. Cartwright
Fatemeh Pishdadian
J. P. Bello
23
2
0
06 May 2021
AvaTr: One-Shot Speaker Extraction with Transformers
S. Hu
Md Rifat Arefin
V. Nguyen
Alish Dipani
Xaq Pitkow
A. Tolias
38
4
0
03 May 2021
Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
41
34
0
26 Apr 2021
Improving Neural Silent Speech Interface Models by Adversarial Training
Amin Honarmandi Shandiz
L. Tóth
G. Gosztolya
Alexandra Markó
Tamás Gábor Csapó
AAML
GAN
24
7
0
23 Apr 2021
Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders
Yicong Yu
Amin Honarmandi Shandiz
L. Tóth
22
18
0
23 Apr 2021
Nonlinear Spatial Filtering in Multichannel Speech Enhancement
Kristina Tesch
Timo Gerkmann
19
19
0
22 Apr 2021
Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Yueyue Na
Ziteng Wang
Zhang Liu
Biao Tian
Q. Fu
30
3
0
09 Apr 2021
Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Aswin Sivaraman
Sunwoo Kim
Minje Kim
19
23
0
05 Apr 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Meng Yu
Chunlei Zhang
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
12
30
0
02 Apr 2021
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Chenglin Xu
Wei Rao
Jibin Wu
Haizhou Li
34
32
0
30 Mar 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
21
9
0
30 Mar 2021
On TasNet for Low-Latency Single-Speaker Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
25
2
0
27 Mar 2021
Blind Speech Separation and Dereverberation using Neural Beamforming
Lukas Pfeifenberger
Franz Pernkopf
26
5
0
24 Mar 2021
Compute and memory efficient universal sound source separation
Efthymios Tzinis
Zhepei Wang
Xilin Jiang
Paris Smaragdis
26
40
0
03 Mar 2021
Reverb Conversion of Mixed Vocal Tracks Using an End-to-end Convolutional Deep Neural Network
Junghyun Koo
Seungryeol Paik
Kyogu Lee
24
13
0
03 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
22
6
0
02 Mar 2021
Contrastive Separative Coding for Self-supervised Representation Learning
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
SSL
24
3
0
01 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Ju Lin
A. Wijngaarden
Kuang-Ching Wang
M. C. Smith
27
50
0
24 Feb 2021
Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder
Hu Fang
Guillaume Carbajal
S. Wermter
Timo Gerkmann
39
59
0
17 Feb 2021
Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRL
SSL
18
16
0
12 Feb 2021
Joint Dereverberation and Separation with Iterative Source Steering
Taishi Nakashima
Robin Scheibler
M. Togami
Nobutaka Ono
6
12
0
12 Feb 2021
Multichannel-based learning for audio object extraction
Daniel Arteaga
Jordi Pons
DiffM
13
3
0
11 Feb 2021
Real-time Monaural Speech Enhancement With Short-time Discrete Cosine Transform
Qinglong Li
Fei Gao
Haixing Guan
Kaichi Ma
33
24
0
09 Feb 2021
ICASSP 2021 Deep Noise Suppression Challenge: Decoupling Magnitude and Phase Optimization with a Two-Stage Deep Network
Andong Li
Wenzhe Liu
Xiaoxue Luo
C. Zheng
Xiaodong Li
26
57
0
08 Feb 2021
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
22
13
0
07 Feb 2021
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net
Hyeong-Seok Choi
Sungjin Park
Jie Hwan Lee
Hoon Heo
Dongsuk Jeon
Kyogu Lee
36
57
0
05 Feb 2021
Music source separation conditioned on 3D point clouds
Francesc Lluís
V. Chatziioannou
A. Hofmann
3DPC
24
5
0
03 Feb 2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Yucheng Zhao
Dacheng Yin
Chong Luo
Zhiyuan Zhao
Chuanxin Tang
Wenjun Zeng
Zhengjun Zha
SSL
11
6
0
03 Feb 2021
Towards efficient models for real-time deep noise suppression
Sebastian Braun
H. Gamper
Chandan K. A. Reddy
I. Tashev
21
104
0
22 Jan 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
43
29
0
13 Jan 2021
Neural Network-based Virtual Microphone Estimator
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
S. Araki
22
10
0
12 Jan 2021
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Z. Zhang
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Donald Williamson
Dong Yu
22
28
0
24 Dec 2020
A Synergistic Kalman- and Deep Postfiltering Approach to Acoustic Echo Cancellation
Thomas Haubner
Mhd Modar Halimeh
Andreas Brendel
Walter Kellermann
19
15
0
16 Dec 2020
Group Communication with Context Codec for Lightweight Source Separation
Yi Luo
Cong Han
N. Mesgarani
26
20
0
14 Dec 2020
Towards speech enhancement using a variational U-Net architecture
E. J. Nustede
Jörn Anemüller
17
1
0
07 Dec 2020
Previous
1
2
3
...
10
11
12
13
9
Next