Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.15332
Cited By
Balanced Multimodal Learning via On-the-fly Gradient Modulation
Computer Vision and Pattern Recognition (CVPR), 2022
29 March 2022
Xiaokang Peng
Yake Wei
Andong Deng
Dong Wang
Di Hu
Re-assign community
ArXiv (abs)
PDF
HTML
Github (274★)
Papers citing
"Balanced Multimodal Learning via On-the-fly Gradient Modulation"
50 / 143 papers shown
DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning
Chengxuan Qian
Kai Han
Jing Wang
Chongwen Lyu
Rui Qian
Chongwen Lyu
Zhenlong Yuan
Zhe Liu
Zhe-Yu Liu
431
17
0
09 Mar 2025
Rebalanced Multimodal Learning with Data-aware Unimodal Sampling
Qingyuan Jiang
Zhouyang Chi
Xiao Ma
Qirong Mao
Yang Yang
Jinhui Tang
227
1
0
05 Mar 2025
Attention Bootstrapping for Multi-Modal Test-Time Adaptation
AAAI Conference on Artificial Intelligence (AAAI), 2025
Yusheng Zhao
Junyu Luo
Xiao Luo
Jinsheng Huang
Jingyang Yuan
Zhiping Xiao
Min Zhang
TTA
306
2
0
04 Mar 2025
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Qingyuan Jiang
Longfei Huang
Yang Yang
298
1
0
27 Feb 2025
MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks
Alejandro Guerra-Manzanares
Farah E. Shamout
341
3
0
03 Feb 2025
Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images
Yuze Wang
Rong Xiao
Haifeng Li
Mariana Belgiu
Chao Tao
325
0
0
08 Jan 2025
Balanced Multi-view Clustering
Zhenglai Li
Jun Wang
Chang-Fu Tang
Xinzhong Zhu
Wei Zhang
Xinwang Liu
490
0
0
05 Jan 2025
Balance-aware Sequence Sampling Makes Multi-modal Learning Better
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Zhi-Hao Guan
149
0
0
01 Jan 2025
Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning
RunLin Yu
Yipu Gong
Wenrui Li
Aiwen Sun
Mengren Zheng
VLM
277
0
0
16 Dec 2024
Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
IEEE Transactions on Image Processing (TIP), 2024
Yang Yang
Wenjuan Xi
Luping Zhou
Jinhui Tang
304
7
0
14 Dec 2024
Balancing Multimodal Training Through Game-Theoretic Regularization
Konstantinos Kontras
Thomas Strypsteen
Christos Chatzichristos
Paul P. Liang
Matthew Blaschko
M. D. Vos
404
3
0
11 Nov 2024
Classifier-guided Gradient Modulation for Enhanced Multimodal Learning
Neural Information Processing Systems (NeurIPS), 2024
Zirun Guo
Tao Jin
Jingyuan Chen
Zhou Zhao
241
25
0
03 Nov 2024
On-the-fly Modulation for Balanced Multimodal Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
240
28
0
15 Oct 2024
Recent Advances of Multimodal Continual Learning: A Comprehensive Survey
Dianzhi Yu
Xinni Zhang
Yankai Chen
Aiwei Liu
Yifei Zhang
Philip S. Yu
Irwin King
VLM
CLL
360
30
0
07 Oct 2024
Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Minoh Jeong
Min Namgung
Min Namgung
Luan Tuyen Chau
Yao-Yi Chiang
Alfred Hero
452
3
0
02 Oct 2024
A Survey of Foundation Models for Music Understanding
Wenjun Li
Ying Cai
Ziyang Wu
Wenyi Zhang
Yifan Chen
...
Junwei Han
Bao Ge
Tianming Liu
Lin Gan
Tuo Zhang
266
3
0
15 Sep 2024
DSCLAP: Domain-Specific Contrastive Language-Audio Pre-Training
Shengqiang Liu
D. Liu
Anna Wang
Zhiyu Zhang
Jie Ying Gao
Yali Li
CLIP
VLM
134
1
0
14 Sep 2024
Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout
Anbin QI
Zhongliang Liu
Xinyong Zhou
Jinba Xiao
Fengrun Zhang
Qi Gan
Ming Tao
Gaozheng Zhang
Lu Zhang
VLM
153
10
0
11 Sep 2024
Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion
IEEE transactions on multimedia (IEEE TMM), 2024
Shaoxu Cheng
Kanglei Geng
Chiyuan He
Zihuan Qiu
Linfeng Xu
Heqian Qiu
Lanxiao Wang
Qingbo Wu
Fanman Meng
Hongliang Li
CLL
219
1
0
04 Aug 2024
Detached and Interactive Multimodal Learning
ACM Multimedia (MM), 2024
Yunfeng Fan
Wenchao Xu
Yining Qi
Junhong Liu
Song Guo
346
10
0
28 Jul 2024
Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment
Yuze Zheng
Zixuan Li
Xiangxian Li
Jinxing Liu
Yuqing Wang
Xiangxu Meng
Lei Meng
DiffM
223
2
0
26 Jul 2024
Modality-Balanced Learning for Multimedia Recommendation
Jinghao Zhang
Guofan Liu
Qiang Liu
Shu Wu
Liang Wang
151
17
0
26 Jul 2024
Balanced Multi-Relational Graph Clustering
Zhixiang Shen
Haolan He
Zhao Kang
215
12
0
23 Jul 2024
PASSION: Towards Effective Incomplete Multi-Modal Medical Image Segmentation with Imbalanced Missing Rates
Junjie Shi
Caozhi Shang
Zhaobin Sun
Li Yu
Xin Yang
Zengqiang Yan
233
20
0
20 Jul 2024
Diagnosing and Re-learning for Balanced Multimodal Learning
Yake Wei
Siwei Li
Ruoxuan Feng
Di Hu
220
34
0
12 Jul 2024
GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation
Chenxin Li
Xinyu Liu
Cheng Wang
Yifan Liu
Weihao Yu
Jing Shao
Yixuan Yuan
232
33
0
08 Jul 2024
Multimodal Classification via Modal-Aware Interactive Enhancement
Qing-Yuan Jiang
Zhouyang Chi
Yang Yang
227
3
0
05 Jul 2024
Robust Multimodal Learning via Representation Decoupling
Shicai Wei
Yang Luo
Yuji Wang
Chunbo Luo
OOD
229
13
0
05 Jul 2024
Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection
Zixing Li
Chao Yan
Zhen Lan
Xiaojia Xiang
Han Zhou
Jun Lai
Dengqing Tang
294
2
0
02 Jul 2024
Fairness and Bias in Multimodal AI: A Survey
Tosin Adewumi
Lama Alkhaled
Namrata Gurung
G. V. Boven
Irene Pagliai
344
23
0
27 Jun 2024
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024
Grigor Bezirganyan
Sana Sellami
Laure Berti-Équille
Sébastien Fournier
287
6
0
14 Jun 2024
MA-AVT: Modality Alignment for Parameter-Efficient Audio-Visual Transformers
Tanvir Mahmud
Shentong Mo
Yapeng Tian
Diana Marculescu
186
7
0
07 Jun 2024
Predictive Dynamic Fusion
International Conference on Machine Learning (ICML), 2024
Bing Cao
Yinan Xia
Yi Ding
Changqing Zhang
Qinghua Hu
277
23
0
07 Jun 2024
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Yake Wei
Di Hu
303
62
0
28 May 2024
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Yuhang Yang
Wei Zhai
Chengfeng Wang
Chengjun Yu
Yang Cao
Zheng-jun Zha
318
18
0
22 May 2024
ReconBoost: Boosting Can Achieve Modality Reconcilement
International Conference on Machine Learning (ICML), 2024
Cong Hua
Qianqian Xu
Shilong Bao
Zhiyong Yang
Qingming Huang
217
38
0
15 May 2024
Improving Multimodal Learning with Multi-Loss Gradient Modulation
British Machine Vision Conference (BMVC), 2024
Konstantinos Kontras
Christos Chatzichristos
Matthew Blaschko
M. D. Vos
211
10
0
13 May 2024
Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning
F. Sarfraz
Bahram Zonooz
Elahe Arani
CLL
202
5
0
04 May 2024
MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection
H. R. Medeiros
David Latortue
Fidel Alejandro Guerrero Peña
Eric Granger
M. Pedersoli
198
0
0
29 Apr 2024
Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Qingyang Zhang
Yake Wei
Zongbo Han
Huazhu Fu
Xi Peng
...
Qinghua Hu
Cai Xu
Jie Wen
Di Hu
Changqing Zhang
344
68
0
27 Apr 2024
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks
Yang Yang
Hongpeng Pan
Qingjun Jiang
Yi Tian Xu
Jinghui Tang
191
20
0
12 Apr 2024
Unified Multi-modal Diagnostic Framework with Reconstruction Pre-training and Heterogeneity-combat Tuning
Yupei Zhang
Li Pan
Qiushi Yang
Tan Li
Zhen Chen
311
3
0
09 Apr 2024
Attribution Regularization for Multimodal Paradigms
Sahiti Yerramilli
Jayant Sravan Tamarapalli
Jonathan M Francis
Eric Nyberg
199
4
0
02 Apr 2024
360+x: A Panoptic Multi-modal Scene Understanding Dataset
Hao Chen
Yuqi Hou
Chenyuan Qu
Irene Testini
Xiaohan Hong
Jianbo Jiao
229
24
0
01 Apr 2024
Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction
Hongxia Wang
Yang Yang
Zhuo Zhao
Pengfei Gu
Nishchal Sapkota
Danny Z. Chen
208
7
0
18 Mar 2024
Unleashing Network Potentials for Semantic Scene Completion
Computer Vision and Pattern Recognition (CVPR), 2024
Fengyun Wang
Qianru Sun
Dong Zhang
Jinhui Tang
368
5
0
12 Mar 2024
Answering Diverse Questions via Text Attached with Key Audio-Visual Clues
Qilang Ye
Zitong Yu
Xin Liu
243
4
0
11 Mar 2024
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer
Jianjian Cao
Peng Ye
Shengze Li
Chong Yu
Yansong Tang
Jiwen Lu
Tao Chen
208
43
0
05 Mar 2024
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis
Tao Tang
Guangrun Wang
Yixing Lao
Peng Chen
Jie Liu
Liang Lin
Kaicheng Yu
Xiaodan Liang
213
20
0
27 Feb 2024
Gradient-Guided Modality Decoupling for Missing-Modality Robustness
Hao Wang
Shengda Luo
Guosheng Hu
Jianguo Zhang
231
13
0
26 Feb 2024
Previous
1
2
3
Next
Page 2 of 3