ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.04856
  4. Cited By
Multimodal Deep Learning

Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
ArXiv (abs)PDFHTML

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown
DyFuLM: An Advanced Multimodal Framework for Sentiment Analysis
Ruohan Zhou
Jiachen Yuan
Churui Yang
Wenzheng Huang
Guoyan Zhang
Shiyao Wei
Jiazhen Hu
Ning Xin
Md Maruf Hasan
54
0
0
01 Dec 2025
Advanced Data Collection Techniques in Cloud Security: A Multi-Modal Deep Learning Autoencoder Approach
Advanced Data Collection Techniques in Cloud Security: A Multi-Modal Deep Learning Autoencoder Approach
Aamiruddin Syed
Mohammed Ilyas Ahmad
57
0
0
26 Nov 2025
Distilling Cross-Modal Knowledge via Feature Disentanglement
Distilling Cross-Modal Knowledge via Feature Disentanglement
Junhong Liu
Yuan Zhang
Tao Huang
Wenchao Xu
Renyu Yang
150
0
0
25 Nov 2025
New York Smells: A Large Multimodal Dataset for Olfaction
New York Smells: A Large Multimodal Dataset for Olfaction
Ege Ozguroglu
Junbang Liang
Ruoshi Liu
Mia Chiquier
Michael DeTienne
Wesley Wei Qian
Alexandra Horowitz
Andrew Owens
Carl Vondrick
109
0
0
25 Nov 2025
Solar-GECO: Perovskite Solar Cell Property Prediction with Geometric-Aware Co-Attention
Solar-GECO: Perovskite Solar Cell Property Prediction with Geometric-Aware Co-Attention
Lucas Li
Jean-Baptiste Puel
Florence Carton
Dounya Barrit
Jhony H. Giraldo
130
0
0
24 Nov 2025
Transparent Early ICU Mortality Prediction with Clinical Transformer and Per-Case Modality Attribution
Alexander Bakumenko
Janine Hoelscher
Hudson Smith
81
0
0
19 Nov 2025
Reconstruction-Driven Multimodal Representation Learning for Automated Media Understanding
Reconstruction-Driven Multimodal Representation Learning for Automated Media Understanding
Yassir Benhammou
Suman Kalyan
Sujay Kumar
124
0
0
17 Nov 2025
Robust Defense Strategies for Multimodal Contrastive Learning: Efficient Fine-tuning Against Backdoor Attacks
Robust Defense Strategies for Multimodal Contrastive Learning: Efficient Fine-tuning Against Backdoor Attacks
Md. Iqbal Hossain
Afia Sajeeda
Neeresh Kumar Perla
Ming Shao
AAML
239
0
0
17 Nov 2025
Multimodal ML: Quantifying the Improvement of Calorie Estimation Through Image-Text Pairs
Multimodal ML: Quantifying the Improvement of Calorie Estimation Through Image-Text Pairs
Arya Narang
68
1
0
12 Nov 2025
Countering Multi-modal Representation Collapse through Rank-targeted Fusion
Countering Multi-modal Representation Collapse through Rank-targeted Fusion
Seulgi Kim
Kiran Kokilepersaud
Mohit Prabhushankar
Ghassan AlRegib
116
0
0
09 Nov 2025
The Algorithmic Phase Transition in Correlated Spiked Models
The Algorithmic Phase Transition in Correlated Spiked Models
Zhangsong Li
240
0
0
08 Nov 2025
Modality-Aware SAM: Sharpness-Aware-Minimization Driven Gradient Modulation for Harmonized Multimodal Learning
Modality-Aware SAM: Sharpness-Aware-Minimization Driven Gradient Modulation for Harmonized Multimodal Learning
Hossein R. Nowdeh
Jie Ji
Xiaolong Ma
Fatemeh Afghah
141
0
0
28 Oct 2025
FrogDeepSDM: Improving Frog Counting and Occurrence Prediction Using Multimodal Data and Pseudo-Absence Imputation
FrogDeepSDM: Improving Frog Counting and Occurrence Prediction Using Multimodal Data and Pseudo-Absence Imputation
C. Padubidri
Pranesh Velmurugan
Andreas Lanitis
A. Kamilaris
102
0
0
22 Oct 2025
Multi-modal Co-learning for Earth Observation: Enhancing single-modality models via modality collaboration
Multi-modal Co-learning for Earth Observation: Enhancing single-modality models via modality collaboration
Francisco Mena
Dino Ienco
C. Dantas
R. Interdonato
Andreas Dengel
119
1
0
22 Oct 2025
Spectral Thresholds in Correlated Spiked Models and Fundamental Limits of Partial Least Squares
Spectral Thresholds in Correlated Spiked Models and Fundamental Limits of Partial Least Squares
Pierre Mergny
Lenka Zdeborová
106
1
0
20 Oct 2025
Graph4MM: Weaving Multimodal Learning with Structural Information
Graph4MM: Weaving Multimodal Learning with Structural Information
Xuying Ning
Dongqi Fu
Tianxin Wei
Wujiang Xu
Jingrui He
125
5
0
19 Oct 2025
PassREfinder-FL: Privacy-Preserving Credential Stuffing Risk Prediction via Graph-Based Federated Learning for Representing Password Reuse between Websites
PassREfinder-FL: Privacy-Preserving Credential Stuffing Risk Prediction via Graph-Based Federated Learning for Representing Password Reuse between WebsitesExpert systems with applications (ESWA), 2025
Jaehan Kim
Minkyoo Song
Minjae Seo
Y. Jin
Seungwon Shin
Jinwoo Kim
96
0
0
17 Oct 2025
A Multimodal Approach to Heritage Preservation in the Context of Climate Change
A Multimodal Approach to Heritage Preservation in the Context of Climate Change
David Roqui
Adèle Cormier
nistor Grozavu
Ann Bourges
81
0
0
15 Oct 2025
Contrastive Dimension Reduction: A Systematic Review
Contrastive Dimension Reduction: A Systematic Review
Sam Hawke
Eric Zhang
Jiawen Chen
Didong Li
159
1
0
13 Oct 2025
Mixup Helps Understanding Multimodal Video Better
Mixup Helps Understanding Multimodal Video Better
Xiaoyu Ma
Ding Ding
Hao Chen
133
0
0
13 Oct 2025
Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions
Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions
Wenyuan Zhao
Adithya Balachandran
Chao Tian
Paul Pu Liang
167
0
0
06 Oct 2025
MultiModal Action Conditioned Video Generation
MultiModal Action Conditioned Video Generation
Yichen Li
Antonio Torralba
VGen
185
3
0
02 Oct 2025
Creative synthesis of kinematic mechanisms
Creative synthesis of kinematic mechanisms
Jiong Lin
Jialong Ning
Judah Goldfeder
Hod Lipson
3DV
134
0
0
30 Sep 2025
PEARL: Performance-Enhanced Aggregated Representation Learning
PEARL: Performance-Enhanced Aggregated Representation Learning
Wenhui Li
Shijin Gong
Xinyu Zhang
117
0
0
29 Sep 2025
Defeating Cerberus: Concept-Guided Privacy-Leakage Mitigation in Multimodal Language Models
Defeating Cerberus: Concept-Guided Privacy-Leakage Mitigation in Multimodal Language Models
Boyang Zhang
Istemi Ekin Akkus
Ruichuan Chen
Alice Dethise
Klaus Satzke
Ivica Rimac
Yang Zhang
PILM
195
0
0
29 Sep 2025
InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions
InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions
Liangjian Wen
Qun Dai
Jianzhuang Liu
Jiangtao Zheng
Yong Dai
Dongkai Wang
Zhao Kang
Jun Wang
Z. Xu
Jiang Duan
252
0
0
28 Sep 2025
S$^3$F-Net: A Multi-Modal Approach to Medical Image Classification via Spatial-Spectral Summarizer Fusion Network
S3^33F-Net: A Multi-Modal Approach to Medical Image Classification via Spatial-Spectral Summarizer Fusion Network
Md. Saiful Bari Siddiqui
Mohammed Imamul Hassan Bhuiyan
MedIm
102
0
0
27 Sep 2025
AudioFuse: Unified Spectral-Temporal Learning via a Hybrid ViT-1D CNN Architecture for Robust Phonocardiogram Classification
AudioFuse: Unified Spectral-Temporal Learning via a Hybrid ViT-1D CNN Architecture for Robust Phonocardiogram Classification
Md. Saiful Bari Siddiqui
Utsab Saha
80
0
0
27 Sep 2025
Multi-modal Bayesian Neural Network Surrogates with Conjugate Last-Layer Estimation
Multi-modal Bayesian Neural Network Surrogates with Conjugate Last-Layer Estimation
Ian Taylor
Juliane Mueller
Julie Bessac
93
0
0
26 Sep 2025
Causal Representation Learning from Multimodal Clinical Records under Non-Random Modality Missingness
Causal Representation Learning from Multimodal Clinical Records under Non-Random Modality Missingness
Zihan Liang
Ziwen Pan
Ruoxuan Xiong
CML
128
0
0
21 Sep 2025
Insight-LLM: LLM-enhanced Multi-view Fusion in Insider Threat Detection
Insight-LLM: LLM-enhanced Multi-view Fusion in Insider Threat Detection
Chengyu Song
Jianming Zheng
104
1
0
01 Sep 2025
AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
Shu Shen
Chao Chen
Tong Zhang
237
0
0
27 Aug 2025
The next question after Turing's question: Introducing the Grow-AI test
The next question after Turing's question: Introducing the Grow-AI test
Alexandru Tugui
ELM
119
0
0
22 Aug 2025
SPANER: Shared Prompt Aligner for Multimodal Semantic Representation
SPANER: Shared Prompt Aligner for Multimodal Semantic Representation
Thye Shan Ng
Caren Soyeon Han
Eun-Jung Holden
135
0
0
18 Aug 2025
Arabic Multimodal Machine Learning: Datasets, Applications, Approaches, and Challenges
Arabic Multimodal Machine Learning: Datasets, Applications, Approaches, and Challenges
Abdelhamid Haouhat
Slimane Bellaouar
A. Nehar
H. Cherroun
Ahmed Abdelali
143
1
0
17 Aug 2025
A Semi-supervised Generative Model for Incomplete Multi-view Data Integration with Missing Labels
A Semi-supervised Generative Model for Incomplete Multi-view Data Integration with Missing Labels
Yiyang Shen
Weiran Wang
86
0
0
15 Aug 2025
Landmark Guided Visual Feature Extractor for Visual Speech Recognition with Limited Resource
Landmark Guided Visual Feature Extractor for Visual Speech Recognition with Limited Resource
Lei Yang
Junshan Jin
Mingyuan Zhang
Yi He
Bofan Chen
Shilin Wang
93
0
0
10 Aug 2025
Chain of Questions: Guiding Multimodal Curiosity in Language Models
Chain of Questions: Guiding Multimodal Curiosity in Language Models
Nima Iji
Kia Dashtipour
LRM
165
0
0
06 Aug 2025
Intrusion Detection in Heterogeneous Networks with Domain-Adaptive Multi-Modal Learning
Intrusion Detection in Heterogeneous Networks with Domain-Adaptive Multi-Modal Learning
Mabin Umman Varghese
Zahra Taghiyarrenani
84
0
0
05 Aug 2025
Closing the Modality Gap for Mixed Modality Search
Closing the Modality Gap for Mixed Modality Search
Binxu Li
Yuhui Zhang
Xiaohan Wang
Weixin Liang
Ludwig Schmidt
Serena Yeung-Levy
VLM
133
4
0
25 Jul 2025
Principled Multimodal Representation Learning
Principled Multimodal Representation Learning
Xiaohao Liu
Xiaobo Xia
See-Kiong Ng
Tat-Seng Chua
231
6
0
23 Jul 2025
EVOLVE-X: Embedding Fusion and Language Prompting for User Evolution Forecasting on Social Media
EVOLVE-X: Embedding Fusion and Language Prompting for User Evolution Forecasting on Social Media
Ismail Hossain
Sai Puppala
Md. jahangir Alam
Sajedul Talukder
97
0
0
21 Jul 2025
A Survey of Pun Generation: Datasets, Evaluations and Methodologies
A Survey of Pun Generation: Datasets, Evaluations and Methodologies
Yuchen Su
Yonghua Zhu
Ruofan Wang
Zijian Huang
Diana Benavides-Prado
Michael J. Witbrock
187
0
0
07 Jul 2025
Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges
Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges
Sanjeda Akter
Ibne Farabi Shihab
Anuj Sharma
VLM
304
2
0
02 Jul 2025
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
Xiaoyu Ma
Hao Chen
Yongjian Deng
259
4
0
13 Jun 2025
Hearing Hands: Generating Sounds from Physical Interactions in 3D ScenesComputer Vision and Pattern Recognition (CVPR), 2025
Yiming Dou
Wonseok Oh
Yuqing Luo
Antonio Loquercio
Andrew Owens
190
0
0
11 Jun 2025
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Adarsh Prasad Behera
J. Champati
Roberto Morabito
Sasu Tarkoma
J. Gross
204
5
0
06 Jun 2025
Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques
Jisu An
Junseok Lee
Jeoungeun Lee
Yongseok Son
444
2
0
05 Jun 2025
Computational Thresholds in Multi-Modal Learning via the Spiked Matrix-Tensor Model
Computational Thresholds in Multi-Modal Learning via the Spiked Matrix-Tensor Model
Hugo Tabanelli
Pierre Mergny
Lenka Zdeborová
Florent Krzakala
172
1
0
03 Jun 2025
Leveraging CLIP Encoder for Multimodal Emotion Recognition
Leveraging CLIP Encoder for Multimodal Emotion RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Yehun Song
Sunyoung Cho
VLM
176
5
0
01 Jun 2025
1234...151617
Next
Page 1 of 17
Pageof 17