ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.11844
  4. Cited By
Densely connected multidilated convolutional networks for dense
  prediction tasks
v1v2 (latest)

Densely connected multidilated convolutional networks for dense prediction tasks

Computer Vision and Pattern Recognition (CVPR), 2020
21 November 2020
Naoya Takahashi
Yuki Mitsufuji
    3DV
ArXiv (abs)PDFHTML

Papers citing "Densely connected multidilated convolutional networks for dense prediction tasks"

36 / 36 papers shown
TD3Net: A temporal densely connected multi-dilated convolutional network for lipreading
TD3Net: A temporal densely connected multi-dilated convolutional network for lipreadingJournal of Visual Communication and Image Representation (JVCIR), 2025
B. Lee
Wooseok Shin
Sung Won Han
235
0
0
19 Jun 2025
Text-Queried Audio Source Separation via Hierarchical Modeling
Text-Queried Audio Source Separation via Hierarchical ModelingIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Xinlei Yin
Xiulian Peng
Xue Jiang
Zhiwei Xiong
Yan Lu
168
0
0
27 May 2025
Breaking the Context Bottleneck on Long Time Series Forecasting
Breaking the Context Bottleneck on Long Time Series Forecasting
Chao Ma
Yikai Hou
Xiang Li
Yinggang Sun
Haining Yu
Zhou Fang
Jiaxing Qu
AI4TS
312
0
0
21 Dec 2024
OpenSep: Leveraging Large Language Models with Textual Inversion for
  Open World Audio Separation
OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio SeparationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tanvir Mahmud
Diana Marculescu
VLM
205
3
0
28 Sep 2024
Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models
Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models
Jin Sob Kim
Hyun Joon Park
Wooseok Shin
Juan Yun
Sung Won Han
SLR
453
2
0
12 Sep 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
338
14
0
02 May 2024
Weakly-supervised Audio Separation via Bi-modal Semantic Similarity
Weakly-supervised Audio Separation via Bi-modal Semantic SimilarityInternational Conference on Learning Representations (ICLR), 2024
Tanvir Mahmud
Saeed Amizadeh
K. Koishida
Diana Marculescu
AI4TS
225
5
0
02 Apr 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Frequency-Adaptive Dilated Convolution for Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2024
Linwei Chen
Lin Gu
Ying Fu
743
76
0
08 Mar 2024
Sampling-Frequency-Independent Universal Sound Separation
Sampling-Frequency-Independent Universal Sound Separation
Tomohiko Nakamura
Kohei Yatabe
135
0
0
22 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source
  Separation
A Generalized Bandsplit Neural Network for Cinematic Audio Source SeparationIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
293
7
0
05 Sep 2023
Algorithms of Sampling-Frequency-Independent Layers for Non-integer
  Strides
Algorithms of Sampling-Frequency-Independent Layers for Non-integer StridesEuropean Signal Processing Conference (EUSIPCO), 2023
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
135
4
0
19 Jun 2023
The Whole Is Greater than the Sum of Its Parts: Improving DNN-based
  Music Source Separation
The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source SeparationEURASIP Journal on Audio, Speech, and Music Processing (EURASIP J. Audio Speech Music Process), 2023
Ryosuke Sawata
Naoya Takahashi
Stefan Uhlich
Shusuke Takahashi
Yuki Mitsufuji
162
0
0
13 May 2023
Towards Diverse Binary Segmentation via A Simple yet General Gated
  Network
Towards Diverse Binary Segmentation via A Simple yet General Gated NetworkInternational Journal of Computer Vision (IJCV), 2023
Xiaoqi Zhao
Youwei Pang
Lihe Zhang
Huchuan Lu
Lei Zhang
346
22
0
18 Mar 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part
  Segmentation
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xiangtai Li
Shilin Xu
Jianlong Wu
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
388
27
0
03 Jan 2023
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled
  Videos
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled VideosInternational Conference on Learning Representations (ICLR), 2022
Hao-Wen Dong
Naoya Takahashi
Yuki Mitsufuji
Julian McAuley
Taylor Berg-Kirkpatrick
VLMCLIP
261
36
0
14 Dec 2022
Robust One-Shot Singing Voice Conversion
Robust One-Shot Singing Voice Conversion
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DiffM
265
9
0
20 Oct 2022
PoLyScriber: Integrated Fine-tuning of Extractor and Lyrics Transcriber
  for Polyphonic Music
PoLyScriber: Integrated Fine-tuning of Extractor and Lyrics Transcriber for Polyphonic MusicIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Xiaoxue Gao
Chitralekha Gupta
Haizhou Li
264
9
0
15 Jul 2022
Improving Semantic Segmentation in Transformers using Hierarchical
  Inter-Level Attention
Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention
Gary Leung
Jun Gao
Fangyin Wei
Sanja Fidler
189
3
0
05 Jul 2022
Multi-Task Learning with Multi-Query Transformer for Dense Prediction
Multi-Task Learning with Multi-Query Transformer for Dense Prediction
Yangyang Xu
Xiangtai Li
Haobo Yuan
Jianlong Wu
Lefei Zhang
ViT
449
64
0
28 May 2022
Few-Shot Musical Source Separation
Few-Shot Musical Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yu Wang
Daniel Stoller
Rachel M. Bittner
J. P. Bello
191
21
0
03 May 2022
Danna-Sep: Unite to separate them all
Danna-Sep: Unite to separate them all
Chin-Yun Yu
K. Cheuk
167
3
0
07 Dec 2021
KUIELab-MDX-Net: A Two-Stream Neural Network for Music Demixing
KUIELab-MDX-Net: A Two-Stream Neural Network for Music Demixing
Minseok Kim
Woosung Choi
Jaehwa Chung
Daewon Lee
Soonyoung Jung
210
61
0
24 Nov 2021
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event
  Localization and Detection with Microphone Arrays
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone ArraysIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Thi Ngoc Tho Nguyen
Douglas L. Jones
Karn N. Watcharasupat
Huy P Phan
W. Gan
194
53
0
16 Nov 2021
Semi-supervised Multi-task Learning for Semantics and Depth
Semi-supervised Multi-task Learning for Semantics and Depth
Yufeng Wang
Yi-Hsuan Tsai
Wei-Chih Hung
Wenrui Ding
Shuo Liu
Ming-Hsuan Yang
249
30
0
14 Oct 2021
Spatial Data Augmentation with Simulated Room Impulse Responses for
  Sound Event Localization and Detection
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Yuichiro Koyama
Kazuhide Shigemi
Masafumi Takahashi
Kazuki Shimada
Naoya Takahashi
E. Tsunoo
Shusuke Takahashi
Yuki Mitsufuji
187
15
0
13 Oct 2021
Music Source Separation with Deep Equilibrium Models
Music Source Separation with Deep Equilibrium Models
Yuichiro Koyama
Naoki Murata
Stefan Uhlich
Giorgio Fabbro
Shusuke Takahashi
Yuki Mitsufuji
276
5
0
13 Oct 2021
Spatial mixup: Directional loudness modification as data augmentation
  for sound event localization and detection
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ricardo Falcón Pérez
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
171
5
0
12 Oct 2021
Amicable examples for informed source separation
Amicable examples for informed source separationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Naoya Takahashi
Yuki Mitsufuji
AAML
161
1
0
11 Oct 2021
Source Mixing and Separation Robust Audio Steganography
Source Mixing and Separation Robust Audio Steganography
Naoya Takahashi
M. Singh
Yuki Mitsufuji
145
7
0
11 Oct 2021
End-to-End Complex-Valued Multidilated Convolutional Neural Network for
  Joint Acoustic Echo Cancellation and Noise Suppression
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression
Karn N. Watcharasupat
Thi Ngoc Tho Nguyen
W. Gan
Shengkui Zhao
Bin Ma
196
14
0
02 Oct 2021
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic
  Sound Event Localization and Detection
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
275
71
0
01 Oct 2021
Music Demixing Challenge 2021
Music Demixing Challenge 2021Frontiers in Signal Processing (FSP), 2021
Yuki Mitsufuji
Giorgio Fabbro
Stefan Uhlich
Fabian-Robert Stöter
Alexandre Défossez
Minseok Kim
Woosung Choi
Chin-Yun Yu
K. Cheuk
316
97
0
31 Aug 2021
Improving Polyphonic Sound Event Detection on Multichannel Recordings
  with the Sørensen-Dice Coefficient Loss and Transfer Learning
Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning
Karn N. Watcharasupat
Thi Ngoc Tho Nguyen
Ngoc Khanh Nguyen
Zhen Jian Lee
Douglas L. Jones
W. Gan
221
1
0
22 Jul 2021
What Makes Sound Event Localization and Detection Difficult? Insights
  from Error Analysis
What Makes Sound Event Localization and Detection Difficult? Insights from Error AnalysisWorkshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2021
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Zhen Jian Lee
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
132
7
0
22 Jul 2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse
  Response Simulation for Sound Event Localization and Detection
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Kazuki Shimada
Naoya Takahashi
Yuichiro Koyama
Shusuke Takahashi
E. Tsunoo
Masafumi Takahashi
Yuki Mitsufuji
99
28
0
21 Jun 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes
Points2Sound: From mono to binaural audio using 3D point cloud scenesEURASIP Journal on Audio, Speech, and Music Processing (EURASIP J. Audio Speech Music Process), 2021
Francesc Lluís
V. Chatziioannou
A. Hofmann
3DPC
309
7
0
26 Apr 2021
1