Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2301.04856
Cited By
Multimodal Deep Learning
International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Deep Learning"
50 / 842 papers shown
Title
Imaginations of WALL-E : Reconstructing Experiences with an Imagination-Inspired Module for Advanced AI Systems
Zeinab Taghavi
S. Gooran
Seyed Arshan Dalili
Hamidreza Amirzadeh
Mohammad Jalal Nematbakhsh
Hossein Sameti
118
2
0
20 Aug 2023
Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
J. Wilkins
Justin Salamon
Magdalena Fuentes
J. P. Bello
Oriol Nieto
CLIP
108
6
0
17 Aug 2023
Machine Unlearning: Solutions and Challenges
IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI), 2023
Jie Xu
Zihan Wu
Cong Wang
Xiaohua Jia
MU
412
97
0
14 Aug 2023
Towards Generalist Biomedical AI
Tao Tu
Shekoofeh Azizi
Danny Driess
M. Schaekermann
Mohamed Amin
...
Yossi Matias
K. Singhal
Peter R. Florence
Alan Karthikesalingam
Vivek Natarajan
LM&MA
MedIm
AI4MH
230
394
0
26 Jul 2023
FedMEKT: Distillation-based Embedding Knowledge Transfer for Multimodal Federated Learning
Neural Networks (Neural Netw.), 2023
Huy Q. Le
Minh N. H. Nguyen
Chu Myaet Thwal
Yu Qiao
Chao Zhang
Choong Seon Hong
143
24
0
25 Jul 2023
Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Sagnik Majumder
Ziad Al-Halah
Kristen Grauman
SSL
EgoV
297
8
0
10 Jul 2023
Multimodal Deep Learning for Personalized Renal Cell Carcinoma Prognosis: Integrating CT Imaging and Clinical Data
Maryamalsadat Mahootiha
H. Qadir
Jacob Bergsland
I. Balasingham
118
16
0
07 Jul 2023
Interactive Image Segmentation with Cross-Modality Vision Transformers
Kun Li
G. Vosselman
M. Yang
ViT
134
4
0
05 Jul 2023
Deep Equilibrium Multimodal Fusion
Jinhong Ni
Yalong Bai
Wei Zhang
Ting Yao
Tao Mei
203
7
0
29 Jun 2023
Employing Multimodal Machine Learning for Stress Detection
Journal of Healthcare Engineering (J Healthc Eng), 2021
Rahee Walambe
Pranav Nayak
Ashmit Bhardwaj
K. Kotecha
133
53
0
15 Jun 2023
ZeroForge: Feedforward Text-to-Shape Without 3D Supervision
Kelly O. Marshall
Minh Pham
Ameya Joshi
Anushrut Jignasu
Aditya Balu
Adarsh Krishnamurthy
A. Hegde
CLIP
124
3
0
14 Jun 2023
Modality Influence in Multimodal Machine Learning
Abdelhamid Haouhat
Slimane Bellaouar
A. Nehar
H. Cherroun
195
3
0
10 Jun 2023
Multimodal Fusion Interactions: A Study of Human and Automatic Quantification
International Conference on Multimodal Interaction (ICMI), 2023
Paul Pu Liang
Yun Cheng
Ruslan Salakhutdinov
Louis-Philippe Morency
157
10
0
07 Jun 2023
Learning Representations without Compositional Assumptions
International Conference on Machine Learning (ICML), 2023
Tennison Liu
Jeroen Berrevoets
Zhaozhi Qian
M. Schaar
OCL
SSL
158
1
0
31 May 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Heqing Zou
Meng Shen
Chen Chen
Yuchen Hu
D. Rajan
Chng Eng Siong
SSL
205
26
0
16 May 2023
On Uni-Modal Feature Learning in Supervised Multi-Modal Learning
International Conference on Machine Learning (ICML), 2023
Chenzhuang Du
Jiaye Teng
Tingle Li
Yichen Liu
Tianyuan Yuan
Yue Wang
Yang Yuan
Hang Zhao
326
70
0
02 May 2023
Performance Optimization using Multimodal Modeling and Heterogeneous GNN
IEEE International Symposium on High-Performance Parallel Distributed Computing (HPDC), 2023
Akashnil Dutta
J. Alcaraz
Ali TehraniJamsaz
E. César
A. Sikora
Ali Jannesari
153
12
0
25 Apr 2023
To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review
Entropy (Entropy), 2023
Ravid Shwartz-Ziv
Yann LeCun
SSL
475
98
0
19 Apr 2023
Efficient Multimodal Fusion via Interactive Prompting
Computer Vision and Pattern Recognition (CVPR), 2023
Yaowei Li
Ruijie Quan
Linchao Zhu
Yezhou Yang
156
59
0
13 Apr 2023
Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation
Adriano Cardace
Pierluigi Zama Ramirez
Samuele Salti
Luigi Di Stefano
3DPC
238
16
0
06 Apr 2023
Building artificial neural circuits for domain-general cognition: a primer on brain-inspired systems-level architecture
Jascha Achterberg
Danyal Akarca
Moataz Assem
Moritz P. Heimbach
D. Astle
John Duncan
AI4CE
112
5
0
21 Mar 2023
Identifiability Results for Multimodal Contrastive Learning
International Conference on Learning Representations (ICLR), 2023
Imant Daunhawer
Alice Bizeul
Emanuele Palumbo
Alexander Marx
Julia E. Vogt
172
53
0
16 Mar 2023
Chat with the Environment: Interactive Multimodal Perception Using Large Language Models
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Xufeng Zhao
Mengdi Li
C. Weber
Muhammad Burhan Hafez
S. Wermter
LLMAG
LM&Ro
LRM
351
62
0
14 Mar 2023
Recent Advances and Applications of Machine Learning in Experimental Solid Mechanics: A Review
Applied Mechanics Review (AMR), 2023
Hanxun Jin
Enrui Zhang
H. Espinosa
AI4CE
412
98
0
14 Mar 2023
Robust Multimodal Fusion for Human Activity Recognition
Sanju Xaviar
Xin Yang
Omid Ardakanian
HAI
159
9
0
08 Mar 2023
A Light Weight Model for Active Speaker Detection
Computer Vision and Pattern Recognition (CVPR), 2023
Junhua Liao
Haihan Duan
Kanghui Feng
Wanbing Zhao
Yanbing Yang
Liangyin Chen
192
59
0
08 Mar 2023
Speaker Recognition in Realistic Scenario Using Multimodal Data
Saqlain Hussain Shah
M. S. Saeed
Shah Nawaz
Muhammad Haroon Yousaf
CVBM
165
12
0
25 Feb 2023
Effective Multimodal Reinforcement Learning with Modality Alignment and Importance Enhancement
Jinming Ma
Feng Wu
Yingfeng Chen
Xianpeng Ji
Yu-qiong Ding
OffRL
173
5
0
18 Feb 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Journal of Computing and Information Science in Engineering (JCISE), 2023
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
312
63
0
14 Feb 2023
Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Ryumei Nakada
Halil Ibrahim Gulluk
Zhun Deng
Wenlong Ji
James Zou
Linjun Zhang
SSL
VLM
329
48
0
13 Feb 2023
Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis
Zhu Wang
Sourav Medya
Sathya Ravi
VLM
195
1
0
11 Feb 2023
WF-UNet: Weather Fusion UNet for Precipitation Nowcasting
Christos Kaparakis
S. Mehrkanoon
130
7
0
08 Feb 2023
SwinCross: Cross-modal Swin Transformer for Head-and-Neck Tumor Segmentation in PET/CT Images
Medical Physics (Lancaster) (Med. Phys.), 2023
Gary Y. Li
Junyu Chen
Se-In Jang
Kuang Gong
Shijie Zhao
ViT
MedIm
185
19
0
08 Feb 2023
TAP: The Attention Patch for Cross-Modal Knowledge Transfer from Unlabeled Modality
Yinsong Wang
Shahin Shahrampour
159
0
0
04 Feb 2023
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
274
50
0
01 Feb 2023
Human Fall Detection- Multimodality Approach
Xi Wang
R. Penta
Bhavya Sehgal
D. Chen-Song
42
2
0
01 Feb 2023
Spectral Cross-Domain Neural Network with Soft-adaptive Threshold Spectral Enhancement
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Che Liu
Sibo Cheng
Weiping Ding
Rossella Arcucci
199
13
0
10 Jan 2023
Multimodal Explainability via Latent Shift applied to COVID-19 stratification
Pattern Recognition (Pattern Recogn.), 2022
V. Guarrasi
L. Tronchin
Domenico Albano
E. Faiella
Deborah Fazzini
D. Santucci
Paolo Soda
237
30
0
28 Dec 2022
A Clustering-guided Contrastive Fusion for Multi-view Representation Learning
Guanzhou Ke
Guoqing Chao
Xiaoli Wang
Chenyang Xu
Yong-Nan Zhu
Yang Yu
SSL
198
37
0
28 Dec 2022
Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program
Tiange Luo
Honglak Lee
Justin Johnson
259
6
0
25 Dec 2022
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Kai Li
Fenghua Xie
Hang Chen
K. Yuan
Xiaolin Hu
224
24
0
21 Dec 2022
MAViL: Masked Audio-Video Learners
Neural Information Processing Systems (NeurIPS), 2022
Po-Yao (Bernie) Huang
Vasu Sharma
Hu Xu
Chaitanya K. Ryali
Haoqi Fan
Yanghao Li
Shang-Wen Li
Gargi Ghosh
Jitendra Malik
Christoph Feichtenhofer
278
72
0
15 Dec 2022
See, Hear, and Feel: Smart Sensory Fusion for Robotic Manipulation
Conference on Robot Learning (CoRL), 2022
Hao Li
Yizhi Zhang
Junzhe Zhu
Shaoxiong Wang
Michelle A. Lee
Huazhe Xu
Edward H. Adelson
Li Fei-Fei
Ruohan Gao
Jiajun Wu
167
88
0
07 Dec 2022
Multimodal Query-guided Object Localization
Aditay Tripathi
Rajath R Dani
Anand Mishra
Anirban Chakraborty
170
0
0
01 Dec 2022
Multimodal Learning for Multi-Omics: A Survey
Sina Tabakhi
M. N. I. Suvon
Pegah Ahadian
Haiping Lu
228
15
0
29 Nov 2022
Touch and Go: Learning from Human-Collected Vision and Touch
Neural Information Processing Systems (NeurIPS), 2022
Fengyu Yang
Chenyang Ma
Jiacheng Zhang
Jing Zhu
Wenzhen Yuan
Andrew Owens
208
89
0
22 Nov 2022
TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis
Yilan Zhang
Feng-ying Xie
Jianqing Chen
MedIm
152
41
0
21 Nov 2022
UniMSE: Towards Unified Multimodal Sentiment Analysis and Emotion Recognition
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Guimin Hu
Ting-En Lin
Yi Zhao
Guangming Lu
Yuchuan Wu
Yongbin Li
248
176
0
21 Nov 2022
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
IEEE International Conference on Computer Vision (ICCV), 2022
Xingqian Xu
Zinan Lin
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
467
239
0
15 Nov 2022
PMR: Prototypical Modal Rebalance for Multimodal Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Yunfeng Fan
Wenchao Xu
Yining Qi
Junxiao Wang
Song Guo
1.4K
143
0
14 Nov 2022
Previous
1
2
3
4
5
...
15
16
17
Next