Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2007.13975
Cited By
v1
v2
v3 (latest)
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
28 July 2020
Jing-jing Chen
Qi-rong Mao
Dong Liu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation"
50 / 133 papers shown
Title
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Ye-Xin Lu
Yang Ai
Zhenhua Ling
109
15
0
17 Aug 2023
Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Sankalpa Rijal
Rajan Neupane
Saroj Prasad Mainali
Shishir K. Regmi
Shanta Maharjan
81
0
0
29 Jul 2023
A Comprehensive Survey on Applications of Transformers for Deep Learning Tasks
Saidul Islam
Hanae Elmekki
Ahmed Elsebai
Jamal Bentahar
Najat Drawel
Gaith Rjoub
Witold Pedrycz
ViT
MedIm
123
268
0
11 Jun 2023
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Junyu Wang
67
1
0
09 Jun 2023
Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation
Rawad Melhem
Assef Jafar
Oumayma Al Dakkak
76
1
0
25 May 2023
Efficient Neural Music Generation
Max W. Y. Lam
Qiao Tian
Tang-Chun Li
Zongyu Yin
Siyuan Feng
...
Mingbo Ma
Xuchen Song
Jitong Chen
Yuping Wang
Yuxuan Wang
DiffM
MGen
122
63
0
25 May 2023
ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Feng Dang
Qi Hu
Pengyuan Zhang
Yonghong Yan
72
2
0
15 May 2023
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
Alessio Brutti
S. Squartini
131
5
0
21 Mar 2023
A two-stage speaker extraction algorithm under adverse acoustic conditions using a single-microphone
Aviad Eisenberg
Sharon Gannot
Shlomo E. Chazan
104
2
0
13 Mar 2023
On Neural Architectures for Deep Learning-based Source Separation of Co-Channel OFDM Signals
Gary C. F. Lee
Amir Weiss
A. Lancho
Yury Polyanskiy
G. Wornell
AI4TS
132
6
0
11 Mar 2023
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Zhaoxi Mu
Xinyu Yang
Wenjing Zhu
90
6
0
07 Mar 2023
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments
Zhaoxi Mu
Xinyu Yang
Xiangyuan Yang
Wenjing Zhu
74
5
0
07 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
Shengkui Zhao
Bin Ma
141
63
0
23 Feb 2023
DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Shuo Wang
Xiangyu Kong
Xiulian Peng
H. Movassagh
Vinod Prakash
Yan Lu
75
14
0
21 Feb 2023
Local spectral attention for full-band speech enhancement
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
102
0
0
11 Feb 2023
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
144
33
0
15 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
120
0
0
14 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
161
28
0
01 Dec 2022
JaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Tomohiko Nakamura
Shinnosuke Takamichi
Naoko Tanji
Satoru Fukayama
Hiroshi Saruwatari
125
6
0
29 Nov 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
158
162
0
22 Nov 2022
Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement
Shucong Zhang
Malcolm Chadwick
Alberto Gil C. P. Ramos
S. Bhattacharya
75
5
0
08 Nov 2022
Diffusion-based Generative Speech Source Separation
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
171
55
0
31 Oct 2022
TT-Net: Dual-path transformer based sound field translation in the spherical harmonic domain
Yiwen Wang
Zijian Lan
Xihong Wu
T. Qu
61
1
0
30 Oct 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation
Kashyap Patel
A. Kovalyov
Issa Panahi
84
6
0
28 Oct 2022
CasNet: Investigating Channel Robustness for Speech Separation
Fan Wang
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
67
2
0
27 Oct 2022
Individualized Conditioning and Negative Distances for Speaker Separation
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
67
1
0
12 Oct 2022
Music Source Separation with Band-split RNN
Yi Luo
Jianwei Yu
160
136
0
30 Sep 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
175
88
0
22 Sep 2022
MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Jianrong Wang
Xiaomin Li
Xuewei Li
Mei Yu
Qiang Fang
Li Liu
89
0
0
15 Sep 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
182
128
0
08 Sep 2022
SSDPT: Self-Supervised Dual-Path Transformer for Anomalous Sound Detection in Machine Condition Monitoring
Jisheng Bai
Jianfeng Chen
Mou Wang
Muhammad Saad Ayub
Qingli Yan
101
18
0
06 Aug 2022
PodcastMix: A dataset for separating music and speech in podcasts
Nico M. Schmidt
Jordi Pons
M. Miron
73
3
0
15 Jul 2022
Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction
Zhongweiyang Xu
Xulin Fan
M. Hasegawa-Johnson
87
3
0
09 Jul 2022
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation
Jian Luo
Jianzong Wang
Ning Cheng
Edward Xiao
Xulong Zhang
Jing Xiao
ViT
93
13
0
28 Jun 2022
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Danilo de Oliveira
Tal Peer
Timo Gerkmann
82
23
0
23 Jun 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Rahil Parikh
G. Rochette
C. Espy-Wilson
S. Shamma
UQCV
52
0
0
20 Jun 2022
Resource-Efficient Separation Transformer
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
112
22
0
19 Jun 2022
Feature Learning and Ensemble Pre-Tasks Based Self-Supervised Speech Denoising and Dereverberation
Yi Li
ShuangLin Li
Yang Sun
S. M. Naqvi
74
0
0
10 Jun 2022
Ultra Fast Speech Separation Model with Teacher Student Learning
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Takuya Yoshioka
Shujie Liu
Jinyu Li
Xiangzhan Yu
84
14
0
27 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Jiangyu Han
Yanhua Long
94
6
0
23 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
101
23
0
14 Apr 2022
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks
Fan Wang
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
84
7
0
30 Mar 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
190
114
0
28 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Xue Yang
C. Bao
77
3
0
25 Mar 2022
Improving the transferability of speech separation by meta-learning
Kuan-Po Huang
Yuan-Kuei Wu
Hung-yi Lee
91
1
0
11 Mar 2022
Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems
Rahil Parikh
Ilya Kavalerov
C. Espy-Wilson
Shihab Shamma Institute for Systems Research
55
3
0
08 Mar 2022
MANNER: Multi-view Attention Network for Noise Erasure
Hyun Joon Park
Byung Ha Kang
Wooseok Shin
Jin Sob Kim
S. W. Han
128
53
0
04 Mar 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
136
39
0
16 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
121
28
0
06 Feb 2022
Active Audio-Visual Separation of Dynamic Sound Sources
Sagnik Majumder
Kristen Grauman
148
21
0
02 Feb 2022
Previous
1
2
3
Next