ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.04856
  4. Cited By
Multimodal Deep Learning

Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
ArXiv (abs)PDFHTML

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation
  Learning
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning
Saeed Shurrab
Alejandro Guerra-Manzanares
Farah E. Shamout
251
4
0
05 Jul 2024
Optimal thresholds and algorithms for a model of multi-modal learning in high dimensions
Optimal thresholds and algorithms for a model of multi-modal learning in high dimensions
Christian Keup
Lenka Zdeborová
363
3
0
03 Jul 2024
Light-weight Fine-tuning Method for Defending Adversarial Noise in Pre-trained Medical Vision-Language Models
Light-weight Fine-tuning Method for Defending Adversarial Noise in Pre-trained Medical Vision-Language Models
Xu Han
Linghao Jin
Xuezhe Ma
Xiaofeng Liu
AAML
294
6
0
02 Jul 2024
MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance
  Optimizations
MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance Optimizations
Akash Dutta
Ali Jannesari
235
3
0
02 Jul 2024
A Survey on Mixture of Experts in Large Language Models
A Survey on Mixture of Experts in Large Language Models
Weilin Cai
Juyong Jiang
Fan Wang
Jing Tang
Sunghun Kim
Jiayi Huang
MoE
484
70
0
26 Jun 2024
Camera Model Identification Using Audio and Visual Content from Videos
Camera Model Identification Using Audio and Visual Content from Videos
Ioannis Tsingalis
Christos Korgialas
C. Kotropoulos
110
4
0
25 Jun 2024
How Intermodal Interaction Affects the Performance of Deep Multimodal
  Fusion for Mixed-Type Time Series
How Intermodal Interaction Affects the Performance of Deep Multimodal Fusion for Mixed-Type Time Series
Simon Dietz
Thomas Altstidl
Dario Zanca
Björn Eskofier
An Nguyen
AI4TS
84
1
0
21 Jun 2024
Knockout: A simple way to handle missing inputs
Knockout: A simple way to handle missing inputs
Minh Le Nguyen
Batuhan K. Karaman
Heejong Kim
Alan Q. Wang
Fengbei Liu
M. Sabuncu
OODUQCV
351
4
0
30 May 2024
OmniBind: Teach to Build Unequal-Scale Modality Interaction for
  Omni-Bind of All
OmniBind: Teach to Build Unequal-Scale Modality Interaction for Omni-Bind of All
Yuanhuiyi Lyu
Xueye Zheng
Dahun Kim
Lin Wang
271
21
0
25 May 2024
Review of deep learning models for crypto price prediction:
  implementation and evaluation
Review of deep learning models for crypto price prediction: implementation and evaluation
Jingyang Wu
Xinyi Zhang
Fangyixuan Huang
Haochen Zhou
Rohtiash Chandra
205
10
0
19 May 2024
Data Science Principles for Interpretable and Explainable AI
Data Science Principles for Interpretable and Explainable AIJournal of Data Science (JDS), 2024
Kris Sankaran
FaML
356
5
0
17 May 2024
FORESEE: Multimodal and Multi-view Representation Learning for Robust
  Prediction of Cancer Survival
FORESEE: Multimodal and Multi-view Representation Learning for Robust Prediction of Cancer Survival
Liangrui Pan
Yijun Peng
Yan Li
Yiyi Liang
Liwen Xu
Qingchun Liang
Shaoliang Peng
194
1
0
13 May 2024
Enhancing Apparent Personality Trait Analysis with Cross-Modal
  Embeddings
Enhancing Apparent Personality Trait Analysis with Cross-Modal Embeddings
Ádám Fodor
R. R. Saboundji
András Lőrincz
215
0
0
06 May 2024
3D object quality prediction for Metal Jet Printer with Multimodal
  thermal encoder
3D object quality prediction for Metal Jet Printer with Multimodal thermal encoder
R. Chen
Chen
Wenjia Zheng
Sandeep Jalui
Pavan Suri
Jun Zeng
AI4CE
125
2
0
17 Apr 2024
Deep Learning for Video-Based Assessment of Endotracheal Intubation
  Skills
Deep Learning for Video-Based Assessment of Endotracheal Intubation Skills
Jean-Paul Ainam
Erim Yanik
Rahul Rahul
Taylor Kunkes
Lora Cavuoto
Brian Clemency
Kaori Tanaka
Matthew Hackett
Jack Norfleet
S. De
179
0
0
17 Apr 2024
Multiple-Input Auto-Encoder Guided Feature Selection for IoT Intrusion Detection Systems
Multiple-Input Auto-Encoder Guided Feature Selection for IoT Intrusion Detection Systems
Phai Vu Dinh
Diep N. Nguyen
D. Hoang
Nguyen Quang Uy
E. Dutkiewicz
Son Pham Bao
215
1
0
22 Mar 2024
UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind
  Them All
UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All
Yuanhuiyi Lyu
Xueye Zheng
Jiazhou Zhou
Lin Wang
263
43
0
19 Mar 2024
A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity
  Recognition
A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity Recognition
Abhi Kamboj
Minh Do
217
5
0
17 Mar 2024
Joint Multimodal Transformer for Emotion Recognition in the Wild
Joint Multimodal Transformer for Emotion Recognition in the Wild
Paul Waligora
Haseeb Aslam
Osama Zeeshan
Soufiane Belharbi
A. L. Koerich
M. Pedersoli
Simon L Bacon
Eric Granger
305
23
0
15 Mar 2024
A Hierarchical Fused Quantum Fuzzy Neural Network for Image
  Classification
A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification
Shengyao Wu
Run-Ze Li
Yanqi Song
S. Qin
Qiaoyan Wen
Fei Gao
232
2
0
14 Mar 2024
Answering Diverse Questions via Text Attached with Key Audio-Visual
  Clues
Answering Diverse Questions via Text Attached with Key Audio-Visual Clues
Qilang Ye
Zitong Yu
Xin Liu
243
4
0
11 Mar 2024
Hybrid Quantum-inspired Resnet and Densenet for Pattern Recognition with
  Completeness Analysis
Hybrid Quantum-inspired Resnet and Densenet for Pattern Recognition with Completeness Analysis
Andi Chen
Hua‐Lei Yin
Zeng-Bing Chen
Shengjun Wu
181
4
0
09 Mar 2024
PREDILECT: Preferences Delineated with Zero-Shot Language-based
  Reasoning in Reinforcement Learning
PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning
Simon Holk
Daniel Marta
Iolanda Leite
226
17
0
23 Feb 2024
Boosting gets full Attention for Relational Learning
Boosting gets full Attention for Relational Learning
Mathieu Guillame-Bert
Richard Nock
LMTD
159
0
0
22 Feb 2024
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion
Xing Han
Huy Nguyen
Carl Harris
Nhat Ho
Suchi Saria
MoE
398
54
0
05 Feb 2024
Location Agnostic Adaptive Rain Precipitation Prediction using Deep
  Learning
Location Agnostic Adaptive Rain Precipitation Prediction using Deep Learning
Md Shazid Islam
Md Saydur Rahman
Md. Ehsanul Haque
Farhana Akter Tumpa
M. Hossain
A. Arabi
143
7
0
02 Feb 2024
M2CURL: Sample-Efficient Multimodal Reinforcement Learning via
  Self-Supervised Representation Learning for Robotic Manipulation
M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation
Fotios Lygerakis
Vedant Dave
Elmar Rueckert
SSL
264
9
0
30 Jan 2024
One-Spike SNN: Single-Spike Phase Coding with Base Manipulation for
  ANN-to-SNN Conversion Loss Minimization
One-Spike SNN: Single-Spike Phase Coding with Base Manipulation for ANN-to-SNN Conversion Loss Minimization
Sangwoo Hwang
Jaeha Kung
187
15
0
30 Jan 2024
Cross-Modal Coordination Across a Diverse Set of Input Modalities
Cross-Modal Coordination Across a Diverse Set of Input Modalities
Jorge Sánchez
Rodrigo Laguna
VLM
244
0
0
29 Jan 2024
Distilling Privileged Multimodal Information for Expression Recognition
  using Optimal Transport
Distilling Privileged Multimodal Information for Expression Recognition using Optimal TransportIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024
Haseeb Aslam
Muhammad Osama Zeeshan
Soufiane Belharbi
M. Pedersoli
A. L. Koerich
Simon L Bacon
Eric Granger
333
14
0
27 Jan 2024
Collaborative Position Reasoning Network for Referring Image
  Segmentation
Collaborative Position Reasoning Network for Referring Image Segmentation
Jianjian Cao
Beiya Dai
Yulin Li
Xiameng Qin
Jingdong Wang
299
1
0
22 Jan 2024
Uncertainty-Aware Hardware Trojan Detection Using Multimodal Deep
  Learning
Uncertainty-Aware Hardware Trojan Detection Using Multimodal Deep LearningDesign, Automation and Test in Europe (DATE), 2024
Rahul Vishwakarma
Amin Rezaei
228
6
0
15 Jan 2024
CARAT: Contrastive Feature Reconstruction and Aggregation for
  Multi-Modal Multi-Label Emotion Recognition
CARAT: Contrastive Feature Reconstruction and Aggregation for Multi-Modal Multi-Label Emotion RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2023
Cheng Peng
Ke Chen
Lidan Shou
Gang Chen
330
21
0
15 Dec 2023
On Robustness to Missing Video for Audiovisual Speech Recognition
On Robustness to Missing Video for Audiovisual Speech Recognition
Oscar Chang
Otavio Braga
H. Liao
Dmitriy Serdyuk
Olivier Siohan
287
13
0
13 Dec 2023
CLIP-QDA: An Explainable Concept Bottleneck Model
CLIP-QDA: An Explainable Concept Bottleneck Model
Rémi Kazmierczak
Eloise Berthier
Nicolas Bousquet
Andrea Passerini
466
10
0
30 Nov 2023
Automatic Detection of Alzheimer's Disease with Multi-Modal Fusion of
  Clinical MRI Scans
Automatic Detection of Alzheimer's Disease with Multi-Modal Fusion of Clinical MRI Scans
Long Chen
Liben Chen
Binfeng Xu
Wenxin Zhang
N. Razavian
131
0
0
30 Nov 2023
Multimodal Large Language Models: A Survey
Multimodal Large Language Models: A SurveyBigData Congress [Services Society] (BSS), 2023
Jiayang Wu
Wensheng Gan
Zefeng Chen
Shicheng Wan
Philip S. Yu
241
314
0
22 Nov 2023
Fuse It or Lose It: Deep Fusion for Multimodal Simulation-Based
  Inference
Fuse It or Lose It: Deep Fusion for Multimodal Simulation-Based Inference
Marvin Schmitt
Stefan T. Radev
Paul-Christian Bürkner
378
6
0
17 Nov 2023
UniCat: Crafting a Stronger Fusion Baseline for Multimodal
  Re-Identification
UniCat: Crafting a Stronger Fusion Baseline for Multimodal Re-Identification
Jennifer Crawford
Haoli Yin
Luke McDermott
Daniel Cummings
307
21
0
28 Oct 2023
MOSEL: Inference Serving Using Dynamic Modality Selection
MOSEL: Inference Serving Using Dynamic Modality SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bodun Hu
Le Xu
Jeongyoon Moon
N. Yadwadkar
Aditya Akella
306
5
0
27 Oct 2023
Understanding Transferable Representation Learning and Zero-shot
  Transfer in CLIP
Understanding Transferable Representation Learning and Zero-shot Transfer in CLIPInternational Conference on Learning Representations (ICLR), 2023
Zixiang Chen
Yihe Deng
Yuanzhi Li
Quanquan Gu
VLM
395
18
0
02 Oct 2023
Harnessing Diverse Data for Global Disaster Prediction: A Multimodal
  Framework
Harnessing Diverse Data for Global Disaster Prediction: A Multimodal Framework
Gengyin Liu
Huaiyang Zhong
AI4CE
96
1
0
28 Sep 2023
On the Computational Benefit of Multimodal Learning
On the Computational Benefit of Multimodal LearningInternational Conference on Algorithmic Learning Theory (ALT), 2023
Zhou Lu
229
1
0
25 Sep 2023
Multimodal Deep Learning for Scientific Imaging Interpretation
Multimodal Deep Learning for Scientific Imaging Interpretation
Abdulelah S. Alshehri
Franklin L. Lee
Shihu Wang
123
3
0
21 Sep 2023
A Theory of Multimodal Learning
A Theory of Multimodal LearningNeural Information Processing Systems (NeurIPS), 2023
Zhou Lu
235
30
0
21 Sep 2023
FRAMU: Attention-based Machine Unlearning using Federated Reinforcement
  Learning
FRAMU: Attention-based Machine Unlearning using Federated Reinforcement LearningIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
T. Shaik
Xiaohui Tao
Lin Li
Haoran Xie
Taotao Cai
Xiaofeng Zhu
Qingyuan Li
MU
283
27
0
19 Sep 2023
Intent Detection at Scale: Tuning a Generic Model using Relevant Intents
Intent Detection at Scale: Tuning a Generic Model using Relevant IntentsInternational Conference on Machine Learning and Applications (ICMLA), 2023
Nichal Narotamo
David Aparicio
Tiago Mesquita
Mariana Almeida
VLM
249
0
0
15 Sep 2023
Enhancing multimodal cooperation via sample-level modality valuation
Enhancing multimodal cooperation via sample-level modality valuationComputer Vision and Pattern Recognition (CVPR), 2023
Yake Wei
Ruoxuan Feng
Zihe Wang
Di Hu
480
51
0
12 Sep 2023
Robust-MBDL: A Robust Multi-branch Deep Learning Based Model for
  Remaining Useful Life Prediction and Operational Condition Identification of
  Rotating Machines
Robust-MBDL: A Robust Multi-branch Deep Learning Based Model for Remaining Useful Life Prediction and Operational Condition Identification of Rotating Machines
Khoa Tran
H. Vu
L. D. Pham
N. Boudaoud
198
1
0
12 Sep 2023
Towards Contrastive Learning in Music Video Domain
Towards Contrastive Learning in Music Video Domain
Karel Veldkamp
Mariya Hendriksen
Zoltán Szlávik
Alexander Keijser
SSL
216
3
0
01 Sep 2023
Previous
123456...151617
Next
Page 3 of 17
Pageof 17