Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011

12 January 2023

Christopher Marquardt

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown

Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning

Saeed Shurrab

Alejandro Guerra-Manzanares

Farah E. Shamout

251

05 Jul 2024

Optimal thresholds and algorithms for a model of multi-modal learning in high dimensions

Christian Keup

Lenka Zdeborová

363

03 Jul 2024

Light-weight Fine-tuning Method for Defending Adversarial Noise in Pre-trained Medical Vision-Language Models

Xuezhe Ma

294

02 Jul 2024

MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance Optimizations

Akash Dutta

Ali Jannesari

235

02 Jul 2024

A Survey on Mixture of Experts in Large Language Models

484

26 Jun 2024

Camera Model Identification Using Audio and Visual Content from Videos

Ioannis Tsingalis

Christos Korgialas

C. Kotropoulos

110

25 Jun 2024

How Intermodal Interaction Affects the Performance of Deep Multimodal Fusion for Mixed-Type Time Series

Simon Dietz

An Nguyen

21 Jun 2024

Knockout: A simple way to handle missing inputs

351

30 May 2024

OmniBind: Teach to Build Unequal-Scale Modality Interaction for Omni-Bind of All

Yuanhuiyi Lyu

Xueye Zheng

Dahun Kim

Lin Wang

271

25 May 2024

Review of deep learning models for crypto price prediction: implementation and evaluation

205

19 May 2024

Data Science Principles for Interpretable and Explainable AIJournal of Data Science (JDS), 2024

Kris Sankaran

FaML

356

17 May 2024

FORESEE: Multimodal and Multi-view Representation Learning for Robust Prediction of Cancer Survival

194

13 May 2024

Enhancing Apparent Personality Trait Analysis with Cross-Modal Embeddings

Ádám Fodor

R. R. Saboundji

András Lőrincz

215

06 May 2024

3D object quality prediction for Metal Jet Printer with Multimodal thermal encoder

125

17 Apr 2024

Deep Learning for Video-Based Assessment of Endotracheal Intubation Skills

179

17 Apr 2024

Multiple-Input Auto-Encoder Guided Feature Selection for IoT Intrusion Detection Systems

215

22 Mar 2024

UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All

Yuanhuiyi Lyu

Xueye Zheng

Jiazhou Zhou

Lin Wang

263

19 Mar 2024

A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity Recognition

Abhi Kamboj

Minh Do

217

17 Mar 2024

Joint Multimodal Transformer for Emotion Recognition in the Wild

305

15 Mar 2024

A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification

232

14 Mar 2024

Answering Diverse Questions via Text Attached with Key Audio-Visual Clues

Qilang Ye

Zitong Yu

Xin Liu

243

11 Mar 2024

Hybrid Quantum-inspired Resnet and Densenet for Pattern Recognition with Completeness Analysis

181

09 Mar 2024

PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning

Simon Holk

Daniel Marta

Iolanda Leite

226

23 Feb 2024

Boosting gets full Attention for Relational Learning

Mathieu Guillame-Bert

Richard Nock

LMTD

159

22 Feb 2024

FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion

Huy Nguyen

398

05 Feb 2024

Location Agnostic Adaptive Rain Precipitation Prediction using Deep Learning

143

02 Feb 2024

M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation

264

30 Jan 2024

One-Spike SNN: Single-Spike Phase Coding with Base Manipulation for ANN-to-SNN Conversion Loss Minimization

Sangwoo Hwang

Jaeha Kung

187

30 Jan 2024

Cross-Modal Coordination Across a Diverse Set of Input Modalities

Jorge Sánchez

Rodrigo Laguna

VLM

244

29 Jan 2024

Distilling Privileged Multimodal Information for Expression Recognition using Optimal TransportIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024

Haseeb Aslam

Muhammad Osama Zeeshan

333

27 Jan 2024

Collaborative Position Reasoning Network for Referring Image Segmentation

Jingdong Wang

299

22 Jan 2024

Uncertainty-Aware Hardware Trojan Detection Using Multimodal Deep LearningDesign, Automation and Test in Europe (DATE), 2024

Rahul Vishwakarma

Amin Rezaei

228

15 Jan 2024

CARAT: Contrastive Feature Reconstruction and Aggregation for Multi-Modal Multi-Label Emotion RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2023

330

15 Dec 2023

On Robustness to Missing Video for Audiovisual Speech Recognition

287

13 Dec 2023

CLIP-QDA: An Explainable Concept Bottleneck Model

466

30 Nov 2023

Automatic Detection of Alzheimer's Disease with Multi-Modal Fusion of Clinical MRI Scans

131

30 Nov 2023

Multimodal Large Language Models: A SurveyBigData Congress [Services Society] (BSS), 2023

Jiayang Wu

Wensheng Gan

Zefeng Chen

Shicheng Wan

Philip S. Yu

241

314

22 Nov 2023

Fuse It or Lose It: Deep Fusion for Multimodal Simulation-Based Inference

Marvin Schmitt

Stefan T. Radev

Paul-Christian Bürkner

378

17 Nov 2023

UniCat: Crafting a Stronger Fusion Baseline for Multimodal Re-Identification

307

28 Oct 2023

MOSEL: Inference Serving Using Dynamic Modality SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

306

27 Oct 2023

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIPInternational Conference on Learning Representations (ICLR), 2023

Quanquan Gu

395

02 Oct 2023

Harnessing Diverse Data for Global Disaster Prediction: A Multimodal Framework

Gengyin Liu

Huaiyang Zhong

AI4CE

28 Sep 2023

On the Computational Benefit of Multimodal LearningInternational Conference on Algorithmic Learning Theory (ALT), 2023

Zhou Lu

229

25 Sep 2023

Multimodal Deep Learning for Scientific Imaging Interpretation

Abdulelah S. Alshehri

Franklin L. Lee

Shihu Wang

123

21 Sep 2023

A Theory of Multimodal LearningNeural Information Processing Systems (NeurIPS), 2023

Zhou Lu

235

21 Sep 2023

FRAMU: Attention-based Machine Unlearning using Federated Reinforcement LearningIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023

Haoran Xie

283

19 Sep 2023

Intent Detection at Scale: Tuning a Generic Model using Relevant IntentsInternational Conference on Machine Learning and Applications (ICMLA), 2023

249

15 Sep 2023

Enhancing multimodal cooperation via sample-level modality valuationComputer Vision and Pattern Recognition (CVPR), 2023

480

12 Sep 2023

Robust-MBDL: A Robust Multi-branch Deep Learning Based Model for Remaining Useful Life Prediction and Operational Condition Identification of Rotating Machines

198

12 Sep 2023

Towards Contrastive Learning in Music Video Domain

216

01 Sep 2023