Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.13341
Cited By
Neural Target Speech Extraction: An Overview
31 January 2023
Kateřina Žmolíková
Marc Delcroix
Tsubasa Ochiai
K. Kinoshita
JanHonza'' vCernocký
Dong Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Target Speech Extraction: An Overview"
45 / 45 papers shown
Title
TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models
Junyi Peng
Takanori Ashihara
Marc Delcroix
Tsubasa Ochiai
Oldrich Plchot
Shoko Araki
J. Černocký
ELM
19
0
0
10 May 2025
Listen to Extract: Onset-Prompted Target Speaker Extraction
Pengjie Shen
Kangrui Chen
Shulin He
Pengru Chen
Shuqi Yuan
He Kong
Xueliang Zhang
Z. Wang
41
0
0
08 May 2025
SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation
Zhaoxi Mu
Xinyu Yang
Gang Wang
AuLLM
KELM
VLM
53
0
0
06 May 2025
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
Beilong Tang
Bang Zeng
Ming Li
AI4TS
34
0
0
10 Apr 2025
Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction
Minsu Kim
Rodrigo Mira
Honglie Chen
Stavros Petridis
M. Pantic
59
0
0
13 Mar 2025
End-to-End Multi-Microphone Speaker Extraction Using Relative Transfer Functions
Aviad Eisenberg
Sharon Gannot
Shlomo E. Chazan
38
0
0
10 Feb 2025
30+ Years of Source Separation Research: Achievements and Future Challenges
S. Araki
N. Ito
Reinhold Haeb-Umbach
G. Wichern
Zhong-Qiu Wang
Yuki Mitsufuji
AI4TS
34
0
0
21 Jan 2025
USED: Universal Speaker Extraction and Diarization
Junyi Ao
Mehmet Sinan Yildirim
Ruijie Tao
Mengyao Ge
Shuai Wang
Yan-min Qian
Haizhou Li
33
5
0
17 Jan 2025
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
H. S. Bovbjerg
Jan Østergaard
Jesper Jensen
Zheng-Hua Tan
36
0
0
06 Jan 2025
Distance Based Single-Channel Target Speech Extraction
Runwu Shi
Benjamin Yen
Kazuhiro Nakadai
23
0
0
31 Dec 2024
STCON System for the CHiME-8 Challenge
Anton Mitrofanov
Tatiana Prisyach
Tatiana Timofeeva
Sergei Novoselov
M. Korenevsky
...
Dmitriy Miroshnichenko
Nikita Mamaev
Ilya Odegov
Olga Rudnitskaya
A. Romanenko
26
1
0
17 Oct 2024
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Shuai Wang
Ke Zhang
Shaoxiong Lin
Junjie Li
Xuefei Wang
Meng Ge
Jianwei Yu
Yanmin Qian
Haizhou Li
21
8
0
24 Sep 2024
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction
Junjie Li
Ke Zhang
Shuai Wang
Haizhou Li
Man-Wai Mak
Kong Aik Lee
19
1
0
15 Sep 2024
Interaural time difference loss for binaural target sound extraction
Carlos Hernandez-Olivan
Marc Delcroix
Tsubasa Ochiai
Naohiro Tawara
Tomohiro Nakatani
Shoko Araki
19
1
0
01 Aug 2024
Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?
Atsuo Hiroe
Katsutoshi Itoyama
Kazuhiro Nakadai
22
0
0
22 Jul 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
26
4
0
21 Jul 2024
Target conversation extraction: Source separation using turn-taking dynamics
Tuochao Chen
Qirui Wang
Bohan Wu
Malek Itani
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
20
4
0
15 Jul 2024
Knowledge boosting during low-latency inference
Vidya Srinivas
Malek Itani
Tuochao Chen
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
14
2
0
09 Jul 2024
All Neural Low-latency Directional Speech Extraction
Ashutosh Pandey
Sanha Lee
Juan Azcarreta
Daniel D. E. Wong
Buye Xu
25
2
0
05 Jul 2024
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Hiroshi Sato
Takafumi Moriya
Masato Mimura
Shota Horiguchi
Tsubasa Ochiai
Takanori Ashihara
Atsushi Ando
Kentaro Shinayama
Marc Delcroix
30
1
0
01 Jul 2024
Binaural Selective Attention Model for Target Speaker Extraction
Hanyu Meng
Qiquan Zhang
Xiangyu Zhang
V. Sethu
Eliathamby Ambikairajah
28
1
0
18 Jun 2024
TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information
Yiwen Wang
Xihong Wu
36
2
0
13 Jun 2024
A Near-Real-Time Processing Ego Speech Filtering Pipeline Designed for Speech Interruption During Human-Robot Interaction
Yue Li
Florian A. Kunneman
Koen V. Hindriks
13
2
0
22 May 2024
Look Once to Hear: Target Speech Hearing with Noisy Examples
Bandhav Veluri
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
22
13
0
10 May 2024
Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Tsubasa Ochiai
Kazuma Iwamoto
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
Shoko Araki
Shigeru Katagiri
14
2
0
23 Apr 2024
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy
Wenxuan Wu
Xueyuan Chen
Xixin Wu
Haizhou Li
Helen M. Meng
21
1
0
24 Mar 2024
Single-Channel Robot Ego-Speech Filtering during Human-Robot Interaction
Yue Li
Koen V. Hindriks
Florian A. Kunneman
22
2
0
05 Mar 2024
Target Speech Extraction with Pre-trained Self-supervised Learning Models
Junyi Peng
Marc Delcroix
Tsubasa Ochiai
Oldrich Plchot
Shoko Araki
J. Černocký
19
8
0
17 Feb 2024
Probing Self-supervised Learning Models with Target Speech Extraction
Junyi Peng
Marc Delcroix
Tsubasa Ochiai
Oldrich Plchot
Takanori Ashihara
Shoko Araki
J. Černocký
40
1
0
17 Feb 2024
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
KELM
26
4
0
06 Feb 2024
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
He Zhao
Hangting Chen
Jianwei Yu
Yuehai Wang
34
0
0
29 Jan 2024
Online Similarity-and-Independence-Aware Beamformer for Low-latency Target Sound Extraction
Atsuo Hiroe
17
0
0
27 Dec 2023
ML-ASPA: A Contemplation of Machine Learning-based Acoustic Signal Processing Analysis for Sounds, & Strains Emerging Technology
Ratul Ali
Aktarul Islam
Md. Shohel Rana
Saila Nasrin
Sohel Afzal Shajol
Professor Dr. A.H.M. Saifullah Sadi
11
0
0
18 Dec 2023
A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction
Kohei Saijo
Wangyou Zhang
Zhong-Qiu Wang
Shinji Watanabe
Tetsunori Kobayashi
Tetsuji Ogawa
VLM
13
6
0
12 Oct 2023
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Xiang Hao
Jibin Wu
Jianwei Yu
Chenglin Xu
Kay Chen Tan
19
10
0
11 Oct 2023
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Dongchao Yang
Jinchuan Tian
Xuejiao Tan
Rongjie Huang
Songxiang Liu
...
Jiang Bian
Xixin Wu
Zhou Zhao
Shinji Watanabe
Helen M. Meng
CVBM
AuLLM
20
114
0
01 Oct 2023
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
13
5
0
15 Sep 2023
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction
Shilong Wu
Chenxi Wang
Hang Chen
Yusheng Dai
Chenyue Zhang
...
Sabato Marco Siniscalchi
O. Scharenborg
Zhong-Qiu Wang
Jia Pan
Jianqing Gao
15
9
0
15 Sep 2023
Dual-path Transformer Based Neural Beamformer for Target Speech Extraction
Aoqi Guo
Sichong Qian
Baoxiang Li
Dazhi Gao
19
1
0
30 Aug 2023
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Xiaofei Wang
Manthan Thakker
Zhuo Chen
Naoyuki Kanda
Sefik Emre Eskimez
Sanyuan Chen
M. Tang
Shujie Liu
Jinyu Li
Takuya Yoshioka
13
79
0
14 Aug 2023
Target Speech Extraction with Conditional Diffusion Model
Naoyuki Kamo
Marc Delcroix
Tomohiro Nakatan
DiffM
18
16
0
08 Aug 2023
Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction
Aoqi Guo
Junnan Wu
Peng Gao
Wenbo Zhu
Qinwen Guo
Dazhi Gao
Yujun Wang
11
1
0
28 Jun 2023
End-to-End Joint Target and Non-Target Speakers ASR
Ryo Masumura
Naoki Makishima
Taiga Yamane
Yoshihiko Yamazaki
Saki Mizuno
...
Akihiko Takashima
Satoshi Suzuki
Takafumi Moriya
Nobukatsu Hojo
Atsushi Ando
11
5
0
04 Jun 2023
ICASSP 2023 Deep Noise Suppression Challenge
Harishchandra Dubey
A. Aazami
Vishak Gopal
Sergiy Matusevych
Sebastian Braun
...
Sefik Emre Eskimez
Manthan Thakker
H. Gamper
Takuya Yoshioka
R. Aichner
10
82
0
21 Mar 2023
Heterogeneous Target Speech Separation
Hyunjae Cho
Wonbin Jung
Junhyeok Lee
Paris Smaragdis
Sanghyun Woo
36
26
0
07 Apr 2022
1