ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.13975
  4. Cited By
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
v1v2v3 (latest)

Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation

28 July 2020
Jing-jing Chen
Qi-rong Mao
Dong Liu
ArXiv (abs)PDFHTML

Papers citing "Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation"

33 / 133 papers shown
Title
New Insights on Target Speaker Extraction
New Insights on Target Speaker Extraction
Mohamed Elminshawi
Wolfgang Mack
Srikanth Raj Chetupalli
Soumitro Chakrabarty
Emanuel Habets
94
19
0
01 Feb 2022
Self-Supervised Learning based Monaural Speech Enhancement with
  Multi-Task Pre-Training
Self-Supervised Learning based Monaural Speech Enhancement with Multi-Task Pre-Training
Yi Li
Yang Sun
S. M. Naqvi
SSL
63
0
0
21 Dec 2021
Self-Supervised Learning based Monaural Speech Enhancement with
  Complex-Cycle-Consistent
Self-Supervised Learning based Monaural Speech Enhancement with Complex-Cycle-Consistent
Yi Li
Yang Sun
S. M. Naqvi
82
1
0
21 Dec 2021
U-shaped Transformer with Frequency-Band Aware Attention for Speech
  Enhancement
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
77
30
0
11 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form
  Conversational Speech
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
100
9
0
10 Dec 2021
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel
  Neural Separation Systems
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems
Yi Luo
115
16
0
07 Dec 2021
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable
  and Efficient Speech Enhancement
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Sunwoo Kim
Minje Kim
195
6
0
17 Nov 2021
REAL-M: Towards Speech Separation on Real Mixtures
REAL-M: Towards Speech Separation on Real Mixtures
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
92
21
0
20 Oct 2021
TPARN: Triple-path Attentive Recurrent Network for Time-domain
  Multichannel Speech Enhancement
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
KELM
127
46
0
20 Oct 2021
Adapting Speech Separation to Real-World Meetings Using Mixture
  Invariant Training
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training
Aswin Sivaraman
Scott Wisdom
Hakan Erdogan
J. Hershey
70
22
0
20 Oct 2021
Progressive Learning for Stabilizing Label Selection in Speech
  Separation with Mapping-based Method
Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method
Chenyang Gao
Yue Gu
I. Marsic
128
0
0
20 Oct 2021
Singer separation for karaoke content generation
Singer separation for karaoke content generation
Hsuan-Yu Chen
Xuan-Bo Chen
J. Jang
55
0
0
13 Oct 2021
Dual-branch Attention-In-Attention Transformer for single-channel speech
  enhancement
Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Guochen Yu
Andong Li
C. Zheng
Yinuo Guo
Yutian Wang
Hui Wang
264
92
0
13 Oct 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in
  High-order Latent Domain
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
96
12
0
10 Oct 2021
Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Desh Raj
Liang Lu
Zhuo Chen
Yashesh Gaur
Jinyu Li
79
19
0
17 Sep 2021
Multi-Task Audio Source Separation
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
90
10
0
14 Jul 2021
TENET: A Time-reversal Enhancement Network for Noise-robust ASR
TENET: A Time-reversal Enhancement Network for Noise-robust ASR
Fu-An Chao
Shao-Wei Fan-Jiang
Bi-Cheng Yan
J. Hung
Berlin Chen
89
13
0
04 Jul 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
126
41
0
30 Jun 2021
AvaTr: One-Shot Speaker Extraction with Transformers
AvaTr: One-Shot Speaker Extraction with Transformers
S. Hu
Md Rifat Arefin
V. Nguyen
Alish Dipani
Xaq Pitkow
A. Tolias
75
4
0
03 May 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion
  Network for Speech Enhancement
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
189
117
0
27 Apr 2021
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Xiyun Li
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Jiaming Xu
Bo Xu
Dong Yu
66
14
0
17 Apr 2021
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement
  in the Time Domain
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Kai Wang
Bengbeng He
Weiping Zhu
156
179
0
18 Mar 2021
Tune-In: Training Under Negative Environments with Interference for
  Attention Networks Simulating Cocktail Party Effect
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Jane Polak Scowcroft
Dong Yu
81
6
0
02 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For
  Time-Domain Speech Separation
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Max W. Y. Lam
Jun Wang
Jane Polak Scowcroft
Dong Yu
AI4TS
140
54
0
01 Mar 2021
Dual-Path Modeling for Long Recording Speech Separation in Meetings
Dual-Path Modeling for Long Recording Speech Separation in Meetings
Chenda Li
Zhuo Chen
Yi Luo
Cong Han
Tianyan Zhou
K. Kinoshita
Marc Delcroix
Shinji Watanabe
Y. Qian
54
11
0
23 Feb 2021
TransMask: A Compact and Fast Speech Separation Model Based on
  Transformer
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
Zining Zhang
Bingsheng He
Zhenjie Zhang
75
23
0
19 Feb 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive
  Locally Recurrent Networks
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Jane Polak Scowcroft
Dong Yu
123
31
0
13 Jan 2021
Group Communication with Context Codec for Lightweight Source Separation
Group Communication with Context Codec for Lightweight Source Separation
Yi Luo
Cong Han
N. Mesgarani
105
20
0
14 Dec 2020
A comparison of handcrafted, parameterized, and learnable features for
  speech separation
A comparison of handcrafted, parameterized, and learnable features for speech separation
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
70
4
0
29 Nov 2020
Stabilizing Label Assignment for Speech Separation by Self-supervised
  Pre-training
Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Sung-Feng Huang
Shun-Po Chuang
Da-Rong Liu
Yi-Chen Chen
Gene-Ping Yang
Hung-yi Lee
SSL
116
14
0
29 Oct 2020
Attention is All You Need in Speech Separation
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
186
614
0
25 Oct 2020
Towards Listening to 10 People Simultaneously: An Efficient Permutation
  Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Hideyuki Tachibana
103
15
0
22 Oct 2020
Continuous Speech Separation with Conformer
Continuous Speech Separation with Conformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Jinyu Li
Takuya Yoshioka
Chengyi Wang
Shujie Liu
M. Zhou
121
133
0
13 Aug 2020
Previous
123