ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.12221
  4. Cited By
Modality Competition: What Makes Joint Training of Multi-modal Network
  Fail in Deep Learning? (Provably)

Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)

International Conference on Machine Learning (ICML), 2022
23 March 2022
Yu Huang
Junyang Lin
Chang Zhou
Hongxia Yang
Longbo Huang
ArXiv (abs)PDFHTML

Papers citing "Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)"

50 / 70 papers shown
Mitigating Modality Imbalance in Multi-modal Learning via Multi-objective Optimization
Mitigating Modality Imbalance in Multi-modal Learning via Multi-objective Optimization
Heshan Devaka Fernando
Parikshit Ram
Yi Zhou
Soham Dan
Horst Samulowitz
Nathalie Baracaldo
Tianyi Chen
222
0
0
10 Nov 2025
MILES: Modality-Informed Learning Rate Scheduler for Balancing Multimodal Learning
MILES: Modality-Informed Learning Rate Scheduler for Balancing Multimodal Learning
Alejandro Guerra-Manzanares
Farah E. Shamout
128
0
0
20 Oct 2025
MCE: Towards a General Framework for Handling Missing Modalities under Imbalanced Missing Rates
MCE: Towards a General Framework for Handling Missing Modalities under Imbalanced Missing RatesPattern Recognition (Pattern Recogn.), 2025
Binyu Zhao
Wei Zhang
Zhaonian Zou
144
0
0
12 Oct 2025
Shaping Initial State Prevents Modality Competition in Multi-modal Fusion: A Two-stage Scheduling Framework via Fast Partial Information Decomposition
Shaping Initial State Prevents Modality Competition in Multi-modal Fusion: A Two-stage Scheduling Framework via Fast Partial Information Decomposition
Jiaqi Tang
Yinsong Xu
Yang Liu
Qingchao Chen
138
0
0
25 Sep 2025
Robust Multi-Omics Integration from Incomplete Modalities Significantly Improves Prediction of Alzheimer's Disease
Robust Multi-Omics Integration from Incomplete Modalities Significantly Improves Prediction of Alzheimer's Disease
Sungjoon Park
Kyungwook Lee
Soorin Yim
Doyeong Hwang
Dongyun Kim
...
Amy Dunn
Daniel Gatti
Elissa Chesler
Kristen O'Connell
Kiyoung Kim
80
0
0
25 Sep 2025
AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
Shu Shen
Chao Chen
Tong Zhang
232
0
0
27 Aug 2025
SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models
SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models
Zhenwei Tang
Difan Jiao
Blair Yang
Ashton Anderson
VLMCoGe
142
1
0
25 Aug 2025
Investigating Redundancy in Multimodal Large Language Models with Multiple Vision Encoders
Investigating Redundancy in Multimodal Large Language Models with Multiple Vision Encoders
Yizhou Wang
Song Mao
Yang Chen
Yufan Shen
Yinqiao Yan
...
Botian Shi
Guohang Yan
Zhi Yu
Xuming Hu
Ding Wang
187
3
0
04 Jul 2025
G$^{2}$D: Boosting Multimodal Learning with Gradient-Guided Distillation
G2^{2}2D: Boosting Multimodal Learning with Gradient-Guided Distillation
Mohammed Rakib
A. Bagavathi
252
0
0
26 Jun 2025
RollingQ: Reviving the Cooperation Dynamics in Multimodal Transformer
RollingQ: Reviving the Cooperation Dynamics in Multimodal Transformer
Haotian Ni
Yake Wei
Hang Liu
Gong Chen
Chong Peng
Hao Lin
Di Hu
OffRL
294
1
0
13 Jun 2025
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
Xiaoyu Ma
Hao Chen
Yongjian Deng
244
4
0
13 Jun 2025
RMMSS: Towards Advanced Robust Multi-Modal Semantic Segmentation with Hybrid Prototype Distillation and Feature Selection
RMMSS: Towards Advanced Robust Multi-Modal Semantic Segmentation with Hybrid Prototype Distillation and Feature Selection
Jiaqi Tan
Xu Zheng
Yuhang Liu
339
0
0
19 May 2025
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
Efthymios Georgiou
Vassilis Katsouros
Yannis Avrithis
Alexandros Potamianos
393
1
0
15 Apr 2025
Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition
Adaptive Unimodal Regulation for Balanced Multimodal Information AcquisitionComputer Vision and Pattern Recognition (CVPR), 2025
Chengxiang Huang
Yake Wei
Zequn Yang
D. Hu
284
7
0
24 Mar 2025
See-Saw Modality Balance: See Gradient, and Sew Impaired Vision-Language Balance to Mitigate Dominant Modality Bias
See-Saw Modality Balance: See Gradient, and Sew Impaired Vision-Language Balance to Mitigate Dominant Modality BiasNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Junehyoung Kwon
Mihyeon Kim
Eunju Lee
Juhwan Choi
Youngbin Kim
225
0
0
18 Mar 2025
Rebalanced Multimodal Learning with Data-aware Unimodal Sampling
Qingyuan Jiang
Zhouyang Chi
Xiao Ma
Qirong Mao
Yang Yang
Jinhui Tang
221
1
0
05 Mar 2025
DeepSuM: Deep Sufficient Modality Learning Framework
Zhe Gao
Jian Huang
Ting Li
Xueqin Wang
147
0
0
03 Mar 2025
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Qingyuan Jiang
Longfei Huang
Yang Yang
276
1
0
27 Feb 2025
MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks
MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks
Alejandro Guerra-Manzanares
Farah E. Shamout
333
3
0
03 Feb 2025
Balanced Multi-view Clustering
Balanced Multi-view Clustering
Zhenglai Li
Jun Wang
Chang-Fu Tang
Xinzhong Zhu
Wei Zhang
Xinwang Liu
457
0
0
05 Jan 2025
Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot
  Learning
Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning
RunLin Yu
Yipu Gong
Wenrui Li
Aiwen Sun
Mengren Zheng
VLM
266
0
0
16 Dec 2024
Rebalanced Vision-Language Retrieval Considering Structure-Aware
  Distillation
Rebalanced Vision-Language Retrieval Considering Structure-Aware DistillationIEEE Transactions on Image Processing (TIP), 2024
Yang Yang
Wenjuan Xi
Luping Zhou
Jinhui Tang
294
7
0
14 Dec 2024
Multimodal Integration of Longitudinal Noninvasive Diagnostics for Survival Prediction in Immunotherapy Using Deep Learning
Multimodal Integration of Longitudinal Noninvasive Diagnostics for Survival Prediction in Immunotherapy Using Deep Learning
Melda Yeghaian
Zuhir Bodalal
Daan van den Broek
John B A G Haanen
Regina G H Beets-Tan
Stefano Trebeschi
Marcel A J van Gerven
309
2
0
27 Nov 2024
Balancing Multimodal Training Through Game-Theoretic Regularization
Balancing Multimodal Training Through Game-Theoretic Regularization
Konstantinos Kontras
Thomas Strypsteen
Christos Chatzichristos
Paul P. Liang
Matthew Blaschko
M. D. Vos
396
3
0
11 Nov 2024
Classifier-guided Gradient Modulation for Enhanced Multimodal Learning
Classifier-guided Gradient Modulation for Enhanced Multimodal LearningNeural Information Processing Systems (NeurIPS), 2024
Zirun Guo
Tao Jin
Jingyuan Chen
Zhou Zhao
231
23
0
03 Nov 2024
On-the-fly Modulation for Balanced Multimodal Learning
On-the-fly Modulation for Balanced Multimodal LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
233
28
0
15 Oct 2024
MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven
  Universal Model
MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal Model
Yiwen Ye
Ziyang Chen
Jianpeng Zhang
Yutong Xie
Yong Xia
MedIm
133
8
0
08 Oct 2024
Investigating the Impact of Model Complexity in Large Language Models
Investigating the Impact of Model Complexity in Large Language Models
Jing Luo
Huiyuan Wang
Weiran Huang
217
0
0
01 Oct 2024
Early Joint Learning of Emotion Information Makes MultiModal Model
  Understand You Better
Early Joint Learning of Emotion Information Makes MultiModal Model Understand You Better
Mengying Ge
Mingyang Li
Dongkai Tang
Pengbo Li
Kuo Liu
Shuhao Deng
Songbai Pu
Liu Liu
Yang Song
Tao Zhang
225
7
0
12 Sep 2024
Multimodal Emotion Recognition with Vision-language Prompting and
  Modality Dropout
Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout
Anbin QI
Zhongliang Liu
Xinyong Zhou
Jinba Xiao
Fengrun Zhang
Qi Gan
Ming Tao
Gaozheng Zhang
Lu Zhang
VLM
151
10
0
11 Sep 2024
Audio-Guided Fusion Techniques for Multimodal Emotion Analysis
Audio-Guided Fusion Techniques for Multimodal Emotion Analysis
Pujin Shi
Fei Gao
242
4
0
08 Sep 2024
Cross-Modality Clustering-based Self-Labeling for Multimodal Data
  Classification
Cross-Modality Clustering-based Self-Labeling for Multimodal Data Classification
P. Zyblewski
Leandro L. Minku
199
1
0
05 Aug 2024
Detached and Interactive Multimodal Learning
Detached and Interactive Multimodal LearningACM Multimedia (MM), 2024
Yunfeng Fan
Wenchao Xu
Yining Qi
Junhong Liu
Song Guo
345
9
0
28 Jul 2024
Hierarchical and Decoupled BEV Perception Learning Framework for
  Autonomous Driving
Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving
Yuqi Dai
Jian Sun
Shengbo Eben Li
Qing Xu
Jianqiang Wang
Lei He
Keqiang Li
284
3
0
17 Jul 2024
Diagnosing and Re-learning for Balanced Multimodal Learning
Diagnosing and Re-learning for Balanced Multimodal Learning
Yake Wei
Siwei Li
Ruoxuan Feng
Di Hu
214
34
0
12 Jul 2024
Enhance the Robustness of Text-Centric Multimodal Alignments
Enhance the Robustness of Text-Centric Multimodal Alignments
Ting-Yu Yen
Yun-Da Tsai
Keng-Te Liao
Shou-De Lin
256
4
0
06 Jul 2024
Multimodal Data Integration for Precision Oncology: Challenges and
  Future Directions
Multimodal Data Integration for Precision Oncology: Challenges and Future Directions
Huajun Zhou
Fengtao Zhou
Chenyu Zhao
Yingxue Xu
Luyang Luo
Hao Chen
304
18
0
28 Jun 2024
Generalist Multimodal AI: A Review of Architectures, Challenges and
  Opportunities
Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities
Sai Munikoti
Ian Stewart
Sameera Horawalavithana
Henry Kvinge
Tegan H. Emerson
Sandra E Thompson
Karl Pazdernik
244
4
0
08 Jun 2024
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Yake Wei
Di Hu
295
60
0
28 May 2024
Mitigating Noisy Correspondence by Geometrical Structure Consistency
  Learning
Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning
Zihua Zhao
Mengxi Chen
Tianjie Dai
Jiangchao Yao
Bo han
Ya Zhang
Yanfeng Wang
NoLa
207
10
0
27 May 2024
ReconBoost: Boosting Can Achieve Modality Reconcilement
ReconBoost: Boosting Can Achieve Modality ReconcilementInternational Conference on Machine Learning (ICML), 2024
Cong Hua
Qianqian Xu
Shilong Bao
Zhiyong Yang
Qingming Huang
204
38
0
15 May 2024
Improving Multimodal Learning with Multi-Loss Gradient Modulation
Improving Multimodal Learning with Multi-Loss Gradient ModulationBritish Machine Vision Conference (BMVC), 2024
Konstantinos Kontras
Christos Chatzichristos
Matthew Blaschko
M. D. Vos
210
10
0
13 May 2024
Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Qingyang Zhang
Yake Wei
Zongbo Han
Huazhu Fu
Xi Peng
...
Qinghua Hu
Cai Xu
Jie Wen
Di Hu
Changqing Zhang
311
61
0
27 Apr 2024
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking
  Subnetworks
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks
Yang Yang
Hongpeng Pan
Qingjun Jiang
Yi Tian Xu
Jinghui Tang
186
19
0
12 Apr 2024
Gradient-Guided Modality Decoupling for Missing-Modality Robustness
Gradient-Guided Modality Decoupling for Missing-Modality Robustness
Hao Wang
Shengda Luo
Guosheng Hu
Jianguo Zhang
209
13
0
26 Feb 2024
Can Text-to-image Model Assist Multi-modal Learning for Visual
  Recognition with Visual Modality Missing?
Can Text-to-image Model Assist Multi-modal Learning for Visual Recognition with Visual Modality Missing?
Tiantian Feng
Daniel Yang
Digbalay Bose
Shrikanth Narayanan
274
6
0
14 Feb 2024
Enhancing ID and Text Fusion via Alternative Training in Session-based
  Recommendation
Enhancing ID and Text Fusion via Alternative Training in Session-based Recommendation
Juanhui Li
Haoyu Han
Zhikai Chen
Harry Shomer
Wei Jin
Amin Javari
Shucheng Zhou
195
1
0
14 Feb 2024
Quantifying and Enhancing Multi-modal Robustness with Modality
  Preference
Quantifying and Enhancing Multi-modal Robustness with Modality Preference
Zequn Yang
Yake Wei
Ce Liang
Di Hu
AAML
324
22
0
09 Feb 2024
PowMix: A Versatile Regularizer for Multimodal Sentiment Analysis
PowMix: A Versatile Regularizer for Multimodal Sentiment Analysis
Efthymios Georgiou
Yannis Avrithis
Alexandros Potamianos
162
1
0
19 Dec 2023
Understanding Unimodal Bias in Multimodal Deep Linear Networks
Understanding Unimodal Bias in Multimodal Deep Linear NetworksInternational Conference on Machine Learning (ICML), 2023
Yedi Zhang
Peter E. Latham
Andrew Saxe
272
15
0
01 Dec 2023
12
Next