ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.03600
  4. Cited By
Hybrid Spectrogram and Waveform Source Separation

Hybrid Spectrogram and Waveform Source Separation

5 November 2021
Alexandre Défossez
ArXivPDFHTML

Papers citing "Hybrid Spectrogram and Waveform Source Separation"

50 / 95 papers shown
Title
Recognizing Ornaments in Vocal Indian Art Music with Active Annotation
Recognizing Ornaments in Vocal Indian Art Music with Active Annotation
Sumit Kumar
Parampreet Singh
Vipul Arora
31
0
0
07 May 2025
The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis
The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis
Bernardo Torres
Geoffroy Peeters
G. Richard
41
0
0
06 May 2025
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis
Yizhong Geng
Jizhuo Xu
Zeyu Liang
Jinghan Yang
Xiaoyi Shi
Xiaoyu Shen
19
0
0
10 Apr 2025
Visual-Aware Speech Recognition for Noisy Scenarios
Visual-Aware Speech Recognition for Noisy Scenarios
Lakshmipathi Balaji
Karan Singla
26
0
0
09 Apr 2025
Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation
Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation
Max W. Y. Lam
Yijin Xing
Weiya You
Jingcheng Wu
Zongyu Yin
...
T. Zhao
Chien-Hung Liu
Xuchen Song
Yang Li
Yahui Zhou
LRM
56
2
0
25 Mar 2025
Efficient Adapter Tuning for Joint Singing Voice Beat and Downbeat Tracking with Self-supervised Learning Features
Jiajun Deng
Yaolong Ju
Jing Yang
Simon Lui
Xunying Liu
48
0
0
13 Mar 2025
Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music
Eetu Tunturi
David Diaz-Guerra
A. Politis
Tuomas Virtanen
38
0
0
10 Mar 2025
30+ Years of Source Separation Research: Achievements and Future Challenges
30+ Years of Source Separation Research: Achievements and Future Challenges
S. Araki
N. Ito
Reinhold Haeb-Umbach
G. Wichern
Zhong-Qiu Wang
Yuki Mitsufuji
AI4TS
39
0
0
21 Jan 2025
Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music
Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music
Venkatakrishnan Vaidyanathapuram Krishnan
Noel Alben
Anish Nair
Nathaniel Condit-Schultz
46
0
0
12 Jan 2025
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Tornike Karchkhadze
M. Izadi
Shlomo Dubnov
DiffM
39
2
0
31 Dec 2024
Improving Source Extraction with Diffusion and Consistency Models
Improving Source Extraction with Diffusion and Consistency Models
Tornike Karchkhadze
M. Izadi
Shuo Zhang
DiffM
82
1
0
09 Dec 2024
AEROMamba: An efficient architecture for audio super-resolution using
  generative adversarial networks and state space models
AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models
Wallace Abreu
Luiz Wagner Pereira Biscainho
Mamba
33
0
0
11 Nov 2024
Sing-On-Your-Beat: Simple Text-Controllable Accompaniment Generations
Sing-On-Your-Beat: Simple Text-Controllable Accompaniment Generations
Quoc-Huy Trinh
Minh-Van Nguyen
Trong-Hieu Nguyen-Mau
Khoa Tran
Thanh Do
33
0
0
03 Nov 2024
Automatic Estimation of Singing Voice Musical Dynamics
Automatic Estimation of Singing Voice Musical Dynamics
Jyoti Narang
Nazif Can Tamer
Viviana De La Vega
Xavier Serra
21
0
0
27 Oct 2024
Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds
Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds
Hanbin Bae
Pavel Andreev
Azat Saginbaev
Nicholas Babaev
Won-Jun Lee
Hosang Sung
Hoon-Young Cho
25
0
0
27 Sep 2024
FruitsMusic: A Real-World Corpus of Japanese Idol-Group Songs
FruitsMusic: A Real-World Corpus of Japanese Idol-Group Songs
Hitoshi Suda
Shunsuke Yoshida
Tomohiko Nakamura
Satoru Fukayama
Jun Ogata
26
0
0
19 Sep 2024
SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source
  Separation
SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation
Jaime Garcia-Martinez
David Diaz-Guerra
A. Politis
Tuomas Virtanen
J. Carabias-Orti
P. Vera-Candeas
19
1
0
17 Sep 2024
A Two-Stage Band-Split Mamba-2 Network For Music Separation
A Two-Stage Band-Split Mamba-2 Network For Music Separation
Jinglin Bai
Yuan Fang
Jiajie Wang
Xueliang Zhang
Mamba
22
1
0
10 Sep 2024
SongCreator: Lyrics-based Universal Song Generation
SongCreator: Lyrics-based Universal Song Generation
Shun Lei
Yixuan Zhou
Boshi Tang
Max W. Y. Lam
Feng Liu
Hangyu Liu
Jingcheng Wu
Shiyin Kang
Zhiyong Wu
Helen Meng
44
4
0
09 Sep 2024
The first Cadenza challenges: using machine learning competitions to
  improve music for listeners with a hearing loss
The first Cadenza challenges: using machine learning competitions to improve music for listeners with a hearing loss
Gerardo Roa Dabike
Michael A. Akeroyd
Scott Bannister
Jon P. Barker
Trevor J. Cox
...
Jennifer Firth
S. Graetzer
Alinka Greasley
Rebecca R. Vos
W. Whitmer
14
0
0
08 Sep 2024
Mel-RoFormer for Vocal Separation and Vocal Melody Transcription
Mel-RoFormer for Vocal Separation and Vocal Melody Transcription
Ju-Chiang Wang
Wei-Tsung Lu
Jitong Chen
21
1
0
07 Sep 2024
Source Separation of Multi-source Raw Music using a Residual Quantized
  Variational Autoencoder
Source Separation of Multi-source Raw Music using a Residual Quantized Variational Autoencoder
Leonardo Berti
DRL
27
0
0
12 Aug 2024
Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Ying-Shuo Lee
Yueh-Po Peng
Jui-Te Wu
Ming Cheng
Li Su
Yi-Hsuan Yang
33
0
0
23 Jul 2024
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music
  Generation
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation
Yun-Han Lan
Wen-Yi Hsiao
Hao-Chung Cheng
Yi-Hsuan Yang
48
7
0
21 Jul 2024
From Real to Cloned Singer Identification
From Real to Cloned Singer Identification
Dorian Desblancs
Gabriel Meseguer-Brocal
Romain Hennequin
Manuel Moussallam
40
0
0
11 Jul 2024
MuDiT & MuSiT: Alignment with Colloquial Expression in
  Description-to-Song Generation
MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song Generation
Zihao Wang
Haoxuan Liu
Jiaxing Yu
Tao Zhang
Yan Liu
K. Zhang
58
1
0
03 Jul 2024
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond
  Four Stems
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems
Karn N. Watcharasupat
Alexander Lerch
21
1
0
26 Jun 2024
Joint Audio and Symbolic Conditioning for Temporally Controlled
  Text-to-Music Generation
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation
Or Tal
Alon Ziv
Itai Gat
Felix Kreuk
Yossi Adi
47
13
0
16 Jun 2024
Xi-Net: Transformer Based Seismic Waveform Reconstructor
Xi-Net: Transformer Based Seismic Waveform Reconstructor
Anshuman Gaharwar
P. Kulkarni
Joshua Dickey
Mubarak Shah
32
0
0
14 Jun 2024
Reconstructing the Charlie Parker Omnibook using an audio-to-score
  automatic transcription pipeline
Reconstructing the Charlie Parker Omnibook using an audio-to-score automatic transcription pipeline
Xavier Riley
Simon Dixon
27
0
0
26 May 2024
Music Enhancement with Deep Filters: A Technical Report for The ICASSP
  2024 Cadenza Challenge
Music Enhancement with Deep Filters: A Technical Report for The ICASSP 2024 Cadenza Challenge
Keren Shao
K. Chen
Shlomo Dubnov
16
2
0
17 Apr 2024
Real-time Low-latency Music Source Separation using Hybrid
  Spectrogram-TasNet
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet
Satvik Venkatesh
Arthur Benilov
Philip Coleman
Frederic Roskam
35
5
0
27 Feb 2024
MuChin: A Chinese Colloquial Description Benchmark for Evaluating
  Language Models in the Field of Music
MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music
Zihao W. Wang
Shuyu Li
Tao Zhang
Qi Wang
Pengfei Yu
Jinyang Luo
Yan Liu
Ming Xi
Kejun Zhang
40
4
0
15 Feb 2024
Binaural sound source localization using a hybrid time and frequency
  domain model
Binaural sound source localization using a hybrid time and frequency domain model
Gil Geva
O. Warusfel
Shlomo Dubnov
Tammuz Dubnov
Amir Amedi
Y. Hel-Or
11
1
0
06 Feb 2024
Resource-constrained stereo singing voice cancellation
Resource-constrained stereo singing voice cancellation
Clara Borrelli
James Rae
Dogac Basaran
Matt McVicar
M. Souden
Matthias Mauch
26
0
0
22 Jan 2024
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy
  Compression on Audio and Image Models
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models
Dan G. Jacobellis
Daniel Cummings
N. Yadwadkar
13
2
0
15 Jan 2024
Remixing Music for Hearing Aids Using Ensemble of Fine-Tuned Source
  Separators
Remixing Music for Hearing Aids Using Ensemble of Fine-Tuned Source Separators
Matthew Daly
15
2
0
11 Jan 2024
Sub-band and Full-band Interactive U-Net with DPRNN for Demixing
  Cross-talk Stereo Music
Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Han Yin
Mou Wang
Jisheng Bai
Dongyuan Shi
Woon-Seng Gan
Jianfeng Chen
20
2
0
11 Jan 2024
DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal
  Pitch Estimation
DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal Pitch Estimation
Haojie Wei
Xueke Cao
Wenbo Xu
Tangpeng Dan
Yueguo Chen
VLM
17
2
0
08 Jan 2024
Toward Deep Drum Source Separation
Toward Deep Drum Source Separation
Alessandro Ilic Mezza
Riccardo Giampiccolo
Alberto Bernardini
Augusto Sarti
20
4
0
15 Dec 2023
Evaluating Self-supervised Speech Models on a Taiwanese Hokkien Corpus
Evaluating Self-supervised Speech Models on a Taiwanese Hokkien Corpus
Yi-Hui Chou
Kalvin Chang
Meng-Ju Wu
Winston Ou
Alice Wen-Hsin Bi
...
Iu-Tshian Phoann
Winnie Chang
Chenxuan Cui
Noel Chen
Jiatong Shi
37
3
0
06 Dec 2023
DINO-VITS: Data-Efficient Zero-Shot TTS with Self-Supervised Speaker
  Verification Loss for Noise Robustness
DINO-VITS: Data-Efficient Zero-Shot TTS with Self-Supervised Speaker Verification Loss for Noise Robustness
Vikentii Pankov
Valeria Pronina
Alexander Kuzmin
Maksim Borisov
Nikita Usoltsev
Xingshan Zeng
Alexander Golubkov
Nikolai Ermolenko
Aleksandra Shirshova
Yulia Matveeva
21
2
0
16 Nov 2023
DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Jialu Li
Junhui Li
Pu Wang
Youshan Zhang
18
4
0
30 Oct 2023
JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music
  Generation
JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation
Yao Yao
Peike Li
Boyu Chen
Alex Wang
DiffM
30
9
0
29 Oct 2023
TorchAudio 2.1: Advancing speech recognition, self-supervised learning,
  and audio processing components for PyTorch
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Jeff Hwang
Moto Hira
Caroline Chen
Xiaohui Zhang
Zhaoheng Ni
...
Yumeng Tao
Robin Scheibler
Samuele Cornell
Sean Kim
Stavros Petridis
38
22
0
27 Oct 2023
Real-time Neonatal Chest Sound Separation using Deep Learning
Real-time Neonatal Chest Sound Separation using Deep Learning
Yang Yi Poh
Ethan Grooby
Kenneth Tan
Lindsay Zhou
Arrabella King
Ashwin Ramanathan
Atul Malhotra
Mehrtash Harandi
F. Marzbanrad
26
1
0
26 Oct 2023
High-Fidelity Noise Reduction with Differentiable Signal Processing
High-Fidelity Noise Reduction with Differentiable Signal Processing
C. Steinmetz
Thomas Walther
Joshua D. Reiss
14
3
0
17 Oct 2023
The First Cadenza Signal Processing Challenge: Improving Music for Those
  With a Hearing Loss
The First Cadenza Signal Processing Challenge: Improving Music for Those With a Hearing Loss
Gerardo Roa Dabike
Scott Bannister
Jennifer Firth
S. Graetzer
Rebecca Vos
...
Jon Barker
Trevor J. Cox
Bruno Fazenda
Alinka Greasley
W. Whitmer
14
6
0
09 Oct 2023
The ICASSP SP Cadenza Challenge: Music Demixing/Remixing for Hearing
  Aids
The ICASSP SP Cadenza Challenge: Music Demixing/Remixing for Hearing Aids
Gerardo Roa Dabike
Michael A. Akeroyd
Scott Bannister
Jon Barker
Trevor J. Cox
...
Jennifer Firth
S. Graetzer
Alinka Greasley
Rebecca R. Vos
W. Whitmer
15
6
0
05 Oct 2023
Mel-Band RoFormer for Music Source Separation
Mel-Band RoFormer for Music Source Separation
Ju-Chiang Wang
Wei-Tsung Lu
Minz Won
15
4
0
03 Oct 2023
12
Next