Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04826
Cited By
v1
v2
v3
v4
v5
v6 (latest)
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
11 October 2018
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking"
43 / 193 papers shown
Title
Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Aswin Sivaraman
Minje Kim
SSL
73
11
0
06 Nov 2020
Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain
Shulin He
Hao Li
Xueliang Zhang
31
3
0
25 Oct 2020
Speaker Separation Using Speaker Inventories and Estimated Speech
Peidong Wang
Zhuo Chen
DeLiang Wang
Jinyu Li
Jiawei Liu
93
11
0
20 Oct 2020
Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan
Ruijie Tao
Chenglin Xu
Haizhou Li
53
50
0
15 Oct 2020
Gender domain adaptation for automatic speech recognition task
Sokolov Artem
Andrey V. Savchenko
36
0
0
08 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
58
90
0
04 Oct 2020
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Quan Wang
Ignacio López Moreno
Mert Saglam
K. Wilson
Alan Chiao
...
Yanzhang He
Wei Li
Jason W. Pelecanos
M. Nika
A. Gruenstein
VLM
71
86
0
09 Sep 2020
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
57
9
0
02 Sep 2020
End-to-End Trainable Self-Attentive Shallow Network for Text-Independent Speaker Verification
Hyeonmook Park
Jungbae Park
Sang Wan Lee
20
0
0
14 Aug 2020
Textual Echo Cancellation
Shaojin Ding
Ye Jia
Ke Hu
Quan Wang
70
8
0
13 Aug 2020
MIRNet: Learning multiple identities representations in overlapped speech
Hyewon Han
Soo-Whan Chung
Hong-Goo Kang
69
8
0
04 Aug 2020
Version Control of Speaker Recognition Systems
Quan Wang
Ignacio López Moreno
75
9
0
23 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
70
21
0
25 Jun 2020
SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement
Luka Chkhetiani
Levan Bejanidze
44
1
0
13 Jun 2020
Similarity-and-Independence-Aware Beamformer: Method for Target Source Extraction using Magnitude Spectrogram as Reference
Atsuo Hiroe
59
3
0
01 Jun 2020
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Guohao Li
72
62
0
20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Tingle Li
Qingjian Lin
Yuanyuan Bao
Ming Li
39
38
0
19 May 2020
A Thousand Words are Worth More Than One Recording: NLP Based Speaker Change Point Detection
O. H. Anidjar
Chen Hajaj
A. Dvir
I. Gilad
14
1
0
18 May 2020
Multimodal Target Speech Separation with Voice and Face References
Leyuan Qu
C. Weber
S. Wermter
CVBM
63
19
0
17 May 2020
Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Ivan Medennikov
M. Korenevsky
Tatiana Prisyach
Yuri Y. Khokhlov
Mariya Korenevskaya
...
Anton Mitrofanov
A. Andrusenko
Ivan Podluzhny
A. Laptev
A. Romanenko
63
205
0
14 May 2020
FaceFilter: Audio-visual speech separation using still images
Soo-Whan Chung
Soyeon Choe
Joon Son Chung
Hong-Goo Kang
CVBM
116
67
0
14 May 2020
SpEx+: A Complete Time Domain Speaker Extraction Network
Meng Ge
Chenglin Xu
Longbiao Wang
Chng Eng Siong
Jianwu Dang
Haizhou Li
79
149
0
10 May 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network
Chenglin Xu
Wei Rao
Eng Siong Chng
Haizhou Li
64
173
0
17 Apr 2020
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
101
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
112
265
0
20 Feb 2020
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
109
217
0
30 Jan 2020
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Marc Delcroix
Tsubasa Ochiai
Kateřina Žmolíková
K. Kinoshita
Naohiro Tawara
Tomohiro Nakatani
S. Araki
129
124
0
23 Jan 2020
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation
Rongzhi Gu
Yuexian Zou
60
18
0
02 Jan 2020
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment using Event-Driven Cameras
A. Arriandiaga
Giovanni Morrone
Luca Pasa
Leonardo Badino
Chiara Bartolozzi
40
1
0
05 Dec 2019
Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement
Zhong-Qiu Wang
Hakan Erdogan
Scott Wisdom
K. Wilson
Desh Raj
Shinji Watanabe
Zhuo Chen
J. Hershey
57
1
0
18 Nov 2019
Improving Universal Sound Separation Using Sound Classification
Efthymios Tzinis
Scott Wisdom
J. Hershey
A. Jansen
D. Ellis
VLM
82
73
0
18 Nov 2019
The sound of my voice: speaker representation loss for target voice separation
Seongkyu Mun
Soyeon Choe
Jaesung Huh
Joon Son Chung
47
16
0
06 Nov 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
55
27
0
13 Aug 2019
Personal VAD: Speaker-Conditioned Voice Activity Detection
Shaojin Ding
Quan Wang
Shuo-yiin Chang
Li Wan
Ignacio López Moreno
74
75
0
12 Aug 2019
My lips are concealed: Audio-visual speech enhancement through obstructions
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
65
91
0
11 Jul 2019
Auditory Separation of a Conversation from Background via Attentional Gating
Shariq Mobin
Bruno A. Olshausen
32
2
0
26 May 2019
An Analysis of Speech Enhancement and Recognition Losses in Limited Resources Multi-talker Single Channel Audio-Visual ASR
Luca Pasa
Giovanni Morrone
Leonardo Badino
46
2
0
16 Apr 2019
Improved Speaker-Dependent Separation for CHiME-5 Challenge
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
16
3
0
08 Apr 2019
Time Domain Audio Visual Speech Separation
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
113
118
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
73
88
0
07 Apr 2019
Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification
Gautam Bhattacharya
A. Kannan
Md. Jahangir Alam
P. Kenny
94
60
0
07 Nov 2018
Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training
Christoph Dann
Lihong Li
Wei Wei
83
39
0
07 Nov 2018
Fully Supervised Speaker Diarization
Aonan Zhang
Quan Wang
Zhenyao Zhu
John Paisley
Chong-Jun Wang
BDL
139
218
0
10 Oct 2018
Previous
1
2
3
4