ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.10330
  4. Cited By
One model to enhance them all: array geometry agnostic multi-channel
  personalized speech enhancement

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

20 October 2021
H. Taherian
Sefik Emre Eskimez
Takuya Yoshioka
Huaming Wang
Zhuo Chen
Xuedong Huang
ArXivPDFHTML

Papers citing "One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement"

16 / 16 papers shown
Title
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior
Zhongweiyang Xu
Xulin Fan
Zhong-Qiu Wang
Xilin Jiang
Romit Roy Choudhury
DiffM
46
0
0
20 May 2025
Efficient Area-based and Speaker-Agnostic Source Separation
Efficient Area-based and Speaker-Agnostic Source Separation
Martin Strauss
Okan Kopuklu
26
3
0
19 Aug 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech
  Separation and Recognition
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Guinan Li
Jiajun Deng
Youjun Chen
Mengzhe Geng
Shujie Hu
...
Zengrui Jin
Tianzi Wang
Xurong Xie
Helen Meng
Xunying Liu
VLM
29
0
0
14 Jun 2024
IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source
  Localization
IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization
Ya-Bin Wang
Bing Yang
Xiaofei Li
40
1
0
11 May 2024
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Ju Lin
Niko Moritz
Yiteng Huang
Ruiming Xie
Ming Sun
Christian Fuegen
Frank Seide
30
4
0
18 Jan 2024
Multi-channel Conversational Speaker Separation via Neural Diarization
Multi-channel Conversational Speaker Separation via Neural Diarization
H. Taherian
DeLiang Wang
BDL
34
16
0
15 Nov 2023
Multichannel Voice Trigger Detection Based on
  Transform-average-concatenate
Multichannel Voice Trigger Detection Based on Transform-average-concatenate
Takuya Higuchi
Avamarie Brueggeman
Han Fang
Stephen Shum
16
0
0
27 Sep 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and
  Complexity via Integrated Full- and Sub-Band Modeling
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
AI4TS
15
10
0
18 Apr 2023
Array Configuration-Agnostic Personalized Speech Enhancement using
  Long-Short-Term Spatial Coherence
Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Yicheng Hsu
Yonghan Lee
M. Bai
19
2
0
16 Nov 2022
Breaking the trade-off in personalized speech enhancement with
  cross-task knowledge distillation
Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation
H. Taherian
Sefik Emre Eskimez
Takuya Yoshioka
16
1
0
05 Nov 2022
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker
  Embeddings for Target Speaker Separation
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation
Xiaoyu Liu
Xu Li
Joan Serra
42
9
0
23 Oct 2022
Multi-channel target speech enhancement based on ERB-scaled spatial
  coherence features
Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Yicheng Hsu
Yonghan Lee
M. Bai
8
1
0
17 Jul 2022
Challenges and Opportunities in Multi-device Speech Processing
Challenges and Opportunities in Multi-device Speech Processing
G. Ciccarelli
Jarred Barber
A. Nair
Israel Cohen
Tao Zhang
25
5
0
27 Jun 2022
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement
  Network (E3Net) and Knowledge Distillation
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation
Manthan Thakker
Sefik Emre Eskimez
Takuya Yoshioka
Huaming Wang
14
28
0
02 Apr 2022
VarArray: Array-Geometry-Agnostic Continuous Speech Separation
VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Takuya Yoshioka
Xiaofei Wang
Dongmei Wang
M. Tang
Zirun Zhu
Zhuo Chen
Naoyuki Kanda
17
37
0
12 Oct 2021
Interspeech 2021 Deep Noise Suppression Challenge
Interspeech 2021 Deep Noise Suppression Challenge
Chandan K. A. Reddy
Harishchandra Dubey
K. Koishida
A. Nair
Vishak Gopal
Ross Cutler
Sebastian Braun
H. Gamper
R. Aichner
Sriram Srinivasan
AI4CE
74
160
0
06 Jan 2021
1