ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04826
  4. Cited By
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned
  Spectrogram Masking
v1v2v3v4v5v6 (latest)

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

11 October 2018
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
ArXiv (abs)PDFHTML

Papers citing "VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking"

43 / 193 papers shown
Title
Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Aswin Sivaraman
Minje Kim
SSL
73
11
0
06 Nov 2020
Speakerfilter-Pro: an improved target speaker extractor combines the
  time domain and frequency domain
Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain
Shulin He
Hao Li
Xueliang Zhang
31
3
0
25 Oct 2020
Speaker Separation Using Speaker Inventories and Estimated Speech
Speaker Separation Using Speaker Inventories and Estimated Speech
Peidong Wang
Zhuo Chen
DeLiang Wang
Jinyu Li
Jiawei Liu
93
11
0
20 Oct 2020
Muse: Multi-modal target speaker extraction with visual cues
Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan
Ruijie Tao
Chenglin Xu
Haizhou Li
53
50
0
15 Oct 2020
Gender domain adaptation for automatic speech recognition task
Gender domain adaptation for automatic speech recognition task
Sokolov Artem
Andrey V. Savchenko
36
0
0
08 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
58
90
0
04 Oct 2020
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device
  Speech Recognition
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Quan Wang
Ignacio López Moreno
Mert Saglam
K. Wilson
Alan Chiao
...
Yanzhang He
Wei Li
Jason W. Pelecanos
M. Nika
A. Gruenstein
VLM
71
86
0
09 Sep 2020
Speaker Representation Learning using Global Context Guided Channel and
  Time-Frequency Transformations
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
57
9
0
02 Sep 2020
End-to-End Trainable Self-Attentive Shallow Network for Text-Independent
  Speaker Verification
End-to-End Trainable Self-Attentive Shallow Network for Text-Independent Speaker Verification
Hyeonmook Park
Jungbae Park
Sang Wan Lee
20
0
0
14 Aug 2020
Textual Echo Cancellation
Textual Echo Cancellation
Shaojin Ding
Ye Jia
Ke Hu
Quan Wang
70
8
0
13 Aug 2020
MIRNet: Learning multiple identities representations in overlapped
  speech
MIRNet: Learning multiple identities representations in overlapped speech
Hyewon Han
Soo-Whan Chung
Hong-Goo Kang
69
8
0
04 Aug 2020
Version Control of Speaker Recognition Systems
Version Control of Speaker Recognition Systems
Quan Wang
Ignacio López Moreno
75
9
0
23 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
70
21
0
25 Jun 2020
SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement
SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement
Luka Chkhetiani
Levan Bejanidze
44
1
0
13 Jun 2020
Similarity-and-Independence-Aware Beamformer: Method for Target Source
  Extraction using Magnitude Spectrogram as Reference
Similarity-and-Independence-Aware Beamformer: Method for Target Source Extraction using Magnitude Spectrogram as Reference
Atsuo Hiroe
59
3
0
01 Jun 2020
Active Speakers in Context
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Guohao Li
72
62
0
20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Tingle Li
Qingjian Lin
Yuanyuan Bao
Ming Li
39
38
0
19 May 2020
A Thousand Words are Worth More Than One Recording: NLP Based Speaker
  Change Point Detection
A Thousand Words are Worth More Than One Recording: NLP Based Speaker Change Point Detection
O. H. Anidjar
Chen Hajaj
A. Dvir
I. Gilad
14
1
0
18 May 2020
Multimodal Target Speech Separation with Voice and Face References
Multimodal Target Speech Separation with Voice and Face References
Leyuan Qu
C. Weber
S. Wermter
CVBM
63
19
0
17 May 2020
Target-Speaker Voice Activity Detection: a Novel Approach for
  Multi-Speaker Diarization in a Dinner Party Scenario
Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Ivan Medennikov
M. Korenevsky
Tatiana Prisyach
Yuri Y. Khokhlov
Mariya Korenevskaya
...
Anton Mitrofanov
A. Andrusenko
Ivan Podluzhny
A. Laptev
A. Romanenko
63
205
0
14 May 2020
FaceFilter: Audio-visual speech separation using still images
FaceFilter: Audio-visual speech separation using still images
Soo-Whan Chung
Soyeon Choe
Joon Son Chung
Hong-Goo Kang
CVBM
116
67
0
14 May 2020
SpEx+: A Complete Time Domain Speaker Extraction Network
SpEx+: A Complete Time Domain Speaker Extraction Network
Meng Ge
Chenglin Xu
Longbiao Wang
Chng Eng Siong
Jianwu Dang
Haizhou Li
79
149
0
10 May 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network
SpEx: Multi-Scale Time Domain Speaker Extraction Network
Chenglin Xu
Wei Rao
Eng Siong Chng
Haizhou Li
64
173
0
17 Apr 2020
Voice Separation with an Unknown Number of Multiple Speakers
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
101
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
112
265
0
20 Feb 2020
Continuous speech separation: dataset and analysis
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
109
217
0
30 Jan 2020
Improving speaker discrimination of target speech extraction with
  time-domain SpeakerBeam
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Marc Delcroix
Tsubasa Ochiai
Kateřina Žmolíková
K. Kinoshita
Naohiro Tawara
Tomohiro Nakatani
S. Araki
129
124
0
23 Jan 2020
Temporal-Spatial Neural Filter: Direction Informed End-to-End
  Multi-channel Target Speech Separation
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation
Rongzhi Gu
Yuexian Zou
60
18
0
02 Jan 2020
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment
  using Event-Driven Cameras
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment using Event-Driven Cameras
A. Arriandiaga
Giovanni Morrone
Luca Pasa
Leonardo Badino
Chiara Bartolozzi
40
1
0
05 Dec 2019
Sequential Multi-Frame Neural Beamforming for Speech Separation and
  Enhancement
Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement
Zhong-Qiu Wang
Hakan Erdogan
Scott Wisdom
K. Wilson
Desh Raj
Shinji Watanabe
Zhuo Chen
J. Hershey
57
1
0
18 Nov 2019
Improving Universal Sound Separation Using Sound Classification
Improving Universal Sound Separation Using Sound Classification
Efthymios Tzinis
Scott Wisdom
J. Hershey
A. Jansen
D. Ellis
VLM
82
73
0
18 Nov 2019
The sound of my voice: speaker representation loss for target voice
  separation
The sound of my voice: speaker representation loss for target voice separation
Seongkyu Mun
Soyeon Choe
Jaesung Huh
Joon Son Chung
47
16
0
06 Nov 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and
  Transfer Learning
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
55
27
0
13 Aug 2019
Personal VAD: Speaker-Conditioned Voice Activity Detection
Personal VAD: Speaker-Conditioned Voice Activity Detection
Shaojin Ding
Quan Wang
Shuo-yiin Chang
Li Wan
Ignacio López Moreno
74
75
0
12 Aug 2019
My lips are concealed: Audio-visual speech enhancement through
  obstructions
My lips are concealed: Audio-visual speech enhancement through obstructions
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
65
91
0
11 Jul 2019
Auditory Separation of a Conversation from Background via Attentional
  Gating
Auditory Separation of a Conversation from Background via Attentional Gating
Shariq Mobin
Bruno A. Olshausen
32
2
0
26 May 2019
An Analysis of Speech Enhancement and Recognition Losses in Limited
  Resources Multi-talker Single Channel Audio-Visual ASR
An Analysis of Speech Enhancement and Recognition Losses in Limited Resources Multi-talker Single Channel Audio-Visual ASR
Luca Pasa
Giovanni Morrone
Leonardo Badino
46
2
0
16 Apr 2019
Improved Speaker-Dependent Separation for CHiME-5 Challenge
Improved Speaker-Dependent Separation for CHiME-5 Challenge
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
16
3
0
08 Apr 2019
Time Domain Audio Visual Speech Separation
Time Domain Audio Visual Speech Separation
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
113
118
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
73
88
0
07 Apr 2019
Generative Adversarial Speaker Embedding Networks for Domain Robust
  End-to-End Speaker Verification
Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification
Gautam Bhattacharya
A. Kannan
Md. Jahangir Alam
P. Kenny
94
60
0
07 Nov 2018
Adapting End-to-End Neural Speaker Verification to New Languages and
  Recording Conditions with Adversarial Training
Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training
Christoph Dann
Lihong Li
Wei Wei
83
39
0
07 Nov 2018
Fully Supervised Speaker Diarization
Fully Supervised Speaker Diarization
Aonan Zhang
Quan Wang
Zhenyao Zhu
John Paisley
Chong-Jun Wang
BDL
139
218
0
10 Oct 2018
Previous
1234