ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.04149
  4. Cited By
Joint Optimization of Masks and Deep Recurrent Neural Networks for
  Monaural Source Separation
v1v2v3v4 (latest)

Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation

13 February 2015
Po-Sen Huang
Minje Kim
M. Hasegawa-Johnson
Paris Smaragdis
ArXiv (abs)PDFHTML

Papers citing "Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation"

50 / 100 papers shown
Music Source Restoration
Music Source Restoration
Yongyi Zang
Zheqi Dai
Mark D. Plumbley
Qiuqiang Kong
175
3
0
27 May 2025
Multiple Choice Learning for Efficient Speech Separation with Many
  Speakers
Multiple Choice Learning for Efficient Speech Separation with Many Speakers
David Perera
François Derrida
Théo Mariotte
Gaël Richard
S. Essid
409
3
0
27 Nov 2024
RF Challenge: The Data-Driven Radio Frequency Signal Separation Challenge
RF Challenge: The Data-Driven Radio Frequency Signal Separation ChallengeIEEE Open Journal of the Communications Society (OJ-COMSOC), 2024
A. Lancho
Amir Weiss
Gary C. F. Lee
T. Jayashankar
Binoy G. Kurien
Yury Polyanskiy
G. Wornell
523
10
0
13 Sep 2024
Ground-roll Separation From Land Seismic Records Based on Convolutional
  Neural Network
Ground-roll Separation From Land Seismic Records Based on Convolutional Neural Network
Zhuang Jia
Wenkai Lu
Meng Zhang
Yongkang Miao
200
0
0
05 Sep 2024
Real-time Neonatal Chest Sound Separation using Deep Learning
Real-time Neonatal Chest Sound Separation using Deep Learning
Yang Yi Poh
Ethan Grooby
Kenneth Tan
Lindsay Zhou
Arrabella King
Ashwin Ramanathan
Atul Malhotra
Mehrtash Harandi
F. Marzbanrad
217
1
0
26 Oct 2023
Speech Separation based on Contrastive Learning and Deep Modularization
Speech Separation based on Contrastive Learning and Deep Modularization
Peter Ochieng
SSL
333
0
0
18 May 2023
Universal Source Separation with Weakly Labelled Data
Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong
Kai Chen
Haohe Liu
Xingjian Du
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Mark D. Plumbley
233
40
0
11 May 2023
Multi-Scale Feature Fusion Transformer Network for End-to-End Single
  Channel Speech Separation
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
170
0
0
14 Dec 2022
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound SeparationComputer Vision and Pattern Recognition (CVPR), 2022
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
313
41
0
07 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysisArtificial Intelligence Review (Artif Intell Rev), 2022
P. Ochieng
325
39
0
01 Dec 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial TrainingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
220
7
0
16 Nov 2022
Neural Sound Field Decomposition with Super-resolution of Sound
  Direction
Neural Sound Field Decomposition with Super-resolution of Sound Direction
Qiuqiang Kong
Shilei Liu
Junjie Shi
Xuzhou Ye
Yin Cao
Qiaoxi Zhu
Yong-mei Xu
Yuxuan Wang
203
0
0
22 Oct 2022
Music Source Separation with Band-split RNN
Music Source Separation with Band-split RNNIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yi Luo
Jianwei Yu
320
199
0
30 Sep 2022
Data-Driven Blind Synchronization and Interference Rejection for Digital
  Communication Signals
Data-Driven Blind Synchronization and Interference Rejection for Digital Communication SignalsGlobal Communications Conference (GLOBECOM), 2022
A. Lancho
Amir Weiss
Gary C. F. Lee
Jennifer Tang
Yuheng Bu
Yury Polyanskiy
G. Wornell
213
10
0
11 Sep 2022
Exploiting Temporal Structures of Cyclostationary Signals for
  Data-Driven Single-Channel Source Separation
Exploiting Temporal Structures of Cyclostationary Signals for Data-Driven Single-Channel Source SeparationInternational Workshop on Machine Learning for Signal Processing (MLSP), 2022
Gary C. F. Lee
Amir Weiss
A. Lancho
Jennifer Tang
Yuheng Bu
Yury Polyanskiy
G. Wornell
172
8
0
22 Aug 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech
  Segregation Models
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation ModelsInterspeech (Interspeech), 2022
Rahil Parikh
G. Rochette
C. Espy-Wilson
S. Shamma
UQCV
140
0
0
20 Jun 2022
Improving Target Sound Extraction with Timestamp Information
Improving Target Sound Extraction with Timestamp InformationInterspeech (Interspeech), 2022
Helin Wang
Dongchao Yang
Chao Weng
Jianwei Yu
Yuexian Zou
297
15
0
02 Apr 2022
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Perceptual Contrast Stretching on Target Feature for Speech EnhancementInterspeech (Interspeech), 2022
Rong-Yu Chao
Cheng Yu
Szu-Wei Fu
Xugang Lu
Yu Tsao
VLM
344
21
0
31 Mar 2022
Improved singing voice separation with chromagram-based pitch-aware
  remixing
Improved singing voice separation with chromagram-based pitch-aware remixingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Siyuan Yuan
Zhepei Wang
Umut Isik
Ritwik Giri
J. Valin
M. Goodwin
A. Krishnaswamy
191
13
0
28 Mar 2022
Harmonicity Plays a Critical Role in DNN Based Versus in
  Biologically-Inspired Monaural Speech Segregation Systems
Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rahil Parikh
Ilya Kavalerov
C. Espy-Wilson
Shihab Shamma Institute for Systems Research
157
3
0
08 Mar 2022
Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach
  for Underwater Acoustic Signal Separation
Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach for Underwater Acoustic Signal SeparationItalian National Conference on Sensors (INS), 2022
Jier Chen
Chang Liu
Jiawu Xie
Jie An
Nan Huang
108
19
0
09 Feb 2022
Reduction of Subjective Listening Effort for TV Broadcast Signals with
  Recurrent Neural Networks
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
193
11
0
02 Nov 2021
Adapting Speech Separation to Real-World Meetings Using Mixture
  Invariant Training
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant TrainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Aswin Sivaraman
Scott Wisdom
Hakan Erdogan
J. Hershey
229
23
0
20 Oct 2021
SegMix: Co-occurrence Driven Mixup for Semantic Segmentation and
  Adversarial Robustness
SegMix: Co-occurrence Driven Mixup for Semantic Segmentation and Adversarial RobustnessInternational Journal of Computer Vision (IJCV), 2021
Md. Amirul Islam
M. Kowal
Konstantinos G. Derpanis
Neil D. B. Bruce
196
8
0
23 Aug 2021
The Performance Evaluation of Attention-Based Neural ASR under Mixed
  Speech Input
The Performance Evaluation of Attention-Based Neural ASR under Mixed Speech Input
Bradley He
Martin H. Radfar
170
1
0
03 Aug 2021
Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint
  Optimization
Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint OptimizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Haici Yang
Shivani Firodiya
Nicholas J. Bryan
Minje Kim
243
9
0
28 Jul 2021
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model
  Selection
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model SelectionIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Aswin Sivaraman
Minje Kim
181
12
0
08 May 2021
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound
  Separation
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound SeparationComputer Vision and Pattern Recognition (CVPR), 2021
Yapeng Tian
Di Hu
Chenliang Xu
ObjD
252
92
0
05 Apr 2021
Personalized Speech Enhancement through Self-Supervised Data
  Augmentation and Purification
Personalized Speech Enhancement through Self-Supervised Data Augmentation and PurificationInterspeech (Interspeech), 2021
Aswin Sivaraman
Sunwoo Kim
Minje Kim
290
25
0
05 Apr 2021
Efficient Personalized Speech Enhancement through Self-Supervised
  Learning
Efficient Personalized Speech Enhancement through Self-Supervised LearningIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2021
Aswin Sivaraman
Minje Kim
268
23
0
05 Apr 2021
CatNet: music source separation system with mix-audio augmentation
CatNet: music source separation system with mix-audio augmentation
Xuchen Song
Qiuqiang Kong
Xingjian Du
Yuxuan Wang
245
12
0
19 Feb 2021
Guided Variational Autoencoder for Speech Enhancement With a Supervised
  Classifier
Guided Variational Autoencoder for Speech Enhancement With a Supervised ClassifierIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRLSSL
280
19
0
12 Feb 2021
DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial
  Network for Speech Enhancement
DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network for Speech EnhancementInternational Symposium on Electrical, Electronics and Information Engineering (ISEEIE), 2020
Huixiang Huang
R. Wu
Jingbiao Huang
Jucai Lin
Jun Yin
GAN
149
9
0
19 Dec 2020
Investigating Cross-Domain Losses for Speech Enhancement
Investigating Cross-Domain Losses for Speech EnhancementEuropean Signal Processing Conference (EUSIPCO), 2020
Sherif Abdulatif
Karim Armanious
Jayasankar T. Sajeev
Karim Guirguis
B. Yang
397
9
0
20 Oct 2020
Feature Binding with Category-Dependant MixUp for Semantic Segmentation
  and Adversarial Robustness
Feature Binding with Category-Dependant MixUp for Semantic Segmentation and Adversarial RobustnessBritish Machine Vision Conference (BMVC), 2020
Md. Amirul Islam
M. Kowal
Konstantinos G. Derpanis
Neil D. B. Bruce
207
7
0
13 Aug 2020
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice
  Separation
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation
Weitao Yuan
Bofei Dong
Shengbei Wang
M. Unoki
Wenwu Wang
194
14
0
03 Aug 2020
On the Use of Audio Fingerprinting Features for Speech Enhancement with
  Generative Adversarial Network
On the Use of Audio Fingerprinting Features for Speech Enhancement with Generative Adversarial NetworkIEEE Workshop on Signal Processing Systems (SiPS), 2020
Farnood Faraji
Yazid Attabi
B. Champagne
Weiping Zhu
191
5
0
27 Jul 2020
Dereverberation using joint estimation of dry speech signal and acoustic
  system
Dereverberation using joint estimation of dry speech signal and acoustic system
Sanna Wager
Keunwoo Choi
Simon Durand
230
4
0
24 Jul 2020
A Speech Enhancement Algorithm based on Non-negative Hidden Markov Model
  and Kullback-Leibler Divergence
A Speech Enhancement Algorithm based on Non-negative Hidden Markov Model and Kullback-Leibler Divergence
Yang Xiang
Liming Shi
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
121
2
0
30 Jun 2020
Identify Speakers in Cocktail Parties with End-to-End Attention
Identify Speakers in Cocktail Parties with End-to-End Attention
Junzhe Zhu
M. Hasegawa-Johnson
Leda Sari
155
2
0
22 May 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network
SpEx: Multi-Scale Time Domain Speaker Extraction NetworkIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Chenglin Xu
Wei Rao
Eng Siong Chng
Haizhou Li
194
210
0
17 Apr 2020
A Review of Multi-Objective Deep Learning Speech Denoising Methods
A Review of Multi-Objective Deep Learning Speech Denoising MethodsSpeech Communication (Speech Commun.), 2020
A. Azarang
N. Kehtarnavaz
204
41
0
26 Mar 2020
Source Separation with Deep Generative Priors
Source Separation with Deep Generative PriorsInternational Conference on Machine Learning (ICML), 2020
V. Jayaram
John Thickstun
349
45
0
19 Feb 2020
Source separation with weakly labelled data: An approach to
  computational auditory scene analysis
Source separation with weakly labelled data: An approach to computational auditory scene analysisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Qiuqiang Kong
Yuxuan Wang
Xuchen Song
Yin Cao
Wenwu Wang
Mark D. Plumbley
267
51
0
06 Feb 2020
MITAS: A Compressed Time-Domain Audio Separation Network with Parameter
  Sharing
MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing
Chao-I Tuan
Yuan-Kuei Wu
Hung-yi Lee
Yu Tsao
129
2
0
09 Dec 2019
WildMix Dataset and Spectro-Temporal Transformer Model for Monoaural
  Audio Source Separation
WildMix Dataset and Spectro-Temporal Transformer Model for Monoaural Audio Source Separation
Amir Zadeh
Tianjun Ma
Soujanya Poria
Louis-Philippe Morency
194
8
0
21 Nov 2019
End-to-end Non-Negative Autoencoders for Sound Source Separation
End-to-end Non-Negative Autoencoders for Sound Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Shrikant Venkataramani
Efthymios Tzinis
Paris Smaragdis
343
5
0
31 Oct 2019
Two-Step Sound Source Separation: Training on Learned Latent Targets
Two-Step Sound Source Separation: Training on Learned Latent TargetsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Efthymios Tzinis
Shrikant Venkataramani
Zhepei Wang
Y. C. Sübakan
Paris Smaragdis
272
70
0
22 Oct 2019
Modeling the Comb Filter Effect and Interaural Coherence for Binaural
  Source Separation
Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019
Luca Remaggi
Philip J. B. Jackson
Wenwu Wang
131
8
0
04 Oct 2019
Incremental Binarization On Recurrent Neural Networks For Single-Channel
  Source Separation
Incremental Binarization On Recurrent Neural Networks For Single-Channel Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Sunwoo Kim
Mrinmoy Maity
Minje Kim
MQ
170
16
0
23 Aug 2019
12
Next
Page 1 of 2