Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.09406
Cited By
v1
v2 (latest)
Multimodal Machine Learning: A Survey and Taxonomy
26 May 2017
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Machine Learning: A Survey and Taxonomy"
50 / 941 papers shown
M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation
Fotios Lygerakis
Vedant Dave
Elmar Rueckert
SSL
255
9
0
30 Jan 2024
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models
Weijiao Zhang
Jindong Han
Zhao Xu
Hang Ni
Hao Liu
Hui Xiong
Hui Xiong
AI4CE
555
26
0
30 Jan 2024
Communication-Efficient Multimodal Federated Learning: Joint Modality and Client Selection
Liangqi Yuan
Dong-Jun Han
Su Wang
Devesh Upadhyay
Christopher G. Brinton
200
20
0
30 Jan 2024
Shabari: Delayed Decision-Making for Faster and Efficient Serverless Functions
Prasoon Sinha
Kostis Kaffes
N. Yadwadkar
222
1
0
16 Jan 2024
TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Taeyang Yun
Hyunkuk Lim
Jeong-Hoon Lee
Min Song
232
28
0
16 Jan 2024
Fusing Echocardiography Images and Medical Records for Continuous Patient Stratification
IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control (IEEE TUFFC), 2024
Nathan Painchaud
Jérémie Stym-Popper
P. Courand
Nicolas Thome
Pierre-Marc Jodoin
Nicolas Duchateau
Olivier Bernard
234
4
0
15 Jan 2024
Cross-modal Retrieval for Knowledge-based Visual Question Answering
European Conference on Information Retrieval (ECIR), 2024
Paul Lerner
Olivier Ferret
C. Guinaudeau
252
13
0
11 Jan 2024
Mixture of multilayer stochastic block models for multiview clustering
Kylliann De Santiago
Marie Szafranski
Christophe Ambroise
227
1
0
09 Jan 2024
Complementary Information Mutual Learning for Multimodality Medical Image Segmentation
Chuyun Shen
Wenhao Li
Haoqing Chen
Xiaoling Wang
Fengping Zhu
Yuxin Li
Xiangfeng Wang
Bo Jin
261
4
0
05 Jan 2024
Explore Human Parsing Modality for Action Recognition
CAAI Transactions on Intelligence Technology (CAAI-TIT), 2024
Jinfu Liu
Runwei Ding
Yuhang Wen
Nan Dai
Fanyang Meng
Shen Zhao
Mengyuan Liu
200
13
0
04 Jan 2024
Bayesian Unsupervised Disentanglement of Anatomy and Geometry for Deep Groupwise Image Registration
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Xinzhe Luo
Xin Wang
Linda Shapiro
Chun Yuan
Jianfeng Feng
Xiahai Zhuang
325
0
0
04 Jan 2024
XAI for In-hospital Mortality Prediction via Multimodal ICU Data
Xingqiao Li
Jindong Gu
Zhiyong Wang
Yancheng Yuan
Bo Du
Fengxiang He
181
3
0
29 Dec 2023
Blind Image Quality Assessment: A Brief Survey
Miaohui Wang
143
1
0
27 Dec 2023
Inter-X: Towards Versatile Human-Human Interaction Analysis
Liang Xu
Xintao Lv
Manwen Liao
Xin Jin
Shuwen Wu
...
Fengyun Rao
Xingdong Sheng
Yunhui Liu
Wenjun Zeng
Yunbo Wang
312
76
0
26 Dec 2023
DeepCalliFont: Few-shot Chinese Calligraphy Font Synthesis by Integrating Dual-modality Generative Models
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yitian Liu
Zhouhui Lian
GAN
VLM
169
8
0
16 Dec 2023
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Dingning Liu
Xiaomeng Dong
Renrui Zhang
Xu Luo
Shiyang Feng
Xiaoshui Huang
Yongshun Gong
Zhihui Wang
187
18
0
15 Dec 2023
Defenses in Adversarial Machine Learning: A Survey
Baoyuan Wu
Shaokui Wei
Mingli Zhu
Meixi Zheng
Zihao Zhu
Ruotong Wang
Hongrui Chen
Danni Yuan
Li Liu
Qingshan Liu
AAML
303
22
0
13 Dec 2023
Foundation Models in Robotics: Applications, Challenges, and the Future
Roya Firoozi
Johnathan Tucker
Stephen Tian
Anirudha Majumdar
Jiankai Sun
...
Brian Ichter
Danny Driess
Jiajun Wu
Cewu Lu
Mac Schwager
LM&Ro
AI4CE
LRM
VLM
263
287
0
13 Dec 2023
Non-contact Multimodal Indoor Human Monitoring Systems: A Survey
L. Nguyen
Praneeth Susarla
Anirban Mukherjee
Manuel Lage Cañellas
Constantino Álvarez Casado
Xiaoting Wu
Olli Silvén
D. Jayagopi
Miguel Bordallo López
209
7
0
11 Dec 2023
Bootstrapping Autonomous Driving Radars with Self-Supervised Learning
Yiduo Hao
Sohrab Madani
Junfeng Guan
Mohammed Alloulah
Saurabh Gupta
Haitham Hassanieh
SSL
346
12
0
07 Dec 2023
WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words
Lukas Wolf
Greta Tuckute
Klemen Kotar
Eghbal Hosseini
Tamar I. Regev
Ethan Gotlieb Wilcox
Alex Warstadt
233
3
0
05 Dec 2023
Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
International Conference on Behavioral, Economic, and Socio-Cultural Computing (ICBESC), 2023
David Hason Rudd
Huan Huo
Md. Rafiqul Islam
Guandong Xu
98
3
0
03 Dec 2023
Understanding Unimodal Bias in Multimodal Deep Linear Networks
International Conference on Machine Learning (ICML), 2023
Yedi Zhang
Peter E. Latham
Andrew Saxe
275
15
0
01 Dec 2023
Consensus, dissensus and synergy between clinicians and specialist foundation models in radiology report generation
Ryutaro Tanno
D. G. Barrett
Andrew Sellergren
Sumedh Ghaisas
Sumanth Dathathri
...
S. Shetty
Pushmeet Kohli
Po-Sen Huang
Alan Karthikesalingam
Ira Ktena
MedIm
259
14
0
30 Nov 2023
Improving embedding of graphs with missing data by soft manifolds
Andrea Marinoni
Pietro Lio
Alessandro Barp
Christian Jutten
Mark Girolami
250
0
0
29 Nov 2023
Visual cognition in multimodal large language models
Luca M. Schulze Buschoff
Elif Akata
Matthias Bethge
Eric Schulz
LRM
352
53
0
27 Nov 2023
Rethinking Radiology Report Generation via Causal Reasoning and Counterfactual Augmentation
ACM International Conference on Bioinformatics, Computational Biology and Biomedicine (ACM-BCB), 2023
Xiao Song
Jiafan Liu
Yun Li
Wenbin Lei
Ruxin Wang
CML
151
0
0
22 Nov 2023
DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Luis Oala
M. Maskey
Lilith Bat-Leah
Alicia Parrish
Nezihe Merve Gürel
...
Lora Aroyo
Ce Zhang
Joaquin Vanschoren
Isabelle Guyon
Peter Mattson
AI4CE
270
17
0
21 Nov 2023
Modality Mixer Exploiting Complementary Information for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Muhammad Adi Nugroho
Changick Kim
251
0
0
21 Nov 2023
Deception Detection from Linguistic and Physiological Data Streams Using Bimodal Convolutional Neural Networks
Panfeng Li
M. Abouelenien
Amélie Reymond
Zhicheng Ding
Qikai Yang
Yiming Zhou
359
88
0
18 Nov 2023
Improving Unimodal Inference with Multimodal Transformers
K. Chumachenko
Alexandros Iosifidis
Moncef Gabbouj
219
0
0
16 Nov 2023
Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
IEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023
Cristina Palmero
Mikel de Velasco
Mohamed Amine Hmani
Aymen Mtibaa
Leila Ben Letaifa
...
Anna Esposito
M. El-Yacoubi
Dijana Petrovska – Delacretaz
M. Inés Torres
Sergio Escalera
235
8
0
09 Nov 2023
Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction
Cam-Van Thi Nguyen
Anh-Tuan Mai
The-Son Le
Hai-Dang Kieu
Duc-Trong Le
362
45
0
08 Nov 2023
Transforming Agriculture with Intelligent Data Management and Insights
Yu Pan
Jianxin Sun
Hongfeng Yu
Geng Bai
Yufeng Ge
Joe Luck
Tala Awada
154
7
0
07 Nov 2023
ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations
Muntabir Hasan Choudhury
Lamia Salsabil
William A. Ingram
Edward A. Fox
Jian Wu
149
0
0
07 Nov 2023
Self-MI: Efficient Multimodal Fusion via Self-Supervised Multi-Task Learning with Auxiliary Mutual Information Maximization
Pacific Asia Conference on Language, Information and Computation (PACLIC), 2023
Cam-Van Thi Nguyen
Ngoc-Hoa Thi Nguyen
Duc-Trong Le
Quang-Thuy Ha
SSL
195
0
0
07 Nov 2023
Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects
International Journal of Computer Vision (IJCV), 2023
Elisa Warner
Joonsan Lee
William Hsu
Tanveer Syeda-Mahmood
Charles Kahn
Olivier Gevaert
Arvind Rao
LM&MA
640
36
0
04 Nov 2023
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Information Fusion (Inf. Fusion), 2023
Md Farhan Ishmam
Md Sakib Hossain Shovon
M. F. Mridha
Nilanjan Dey
399
71
0
01 Nov 2023
SimMMDG: A Simple and Effective Framework for Multi-modal Domain Generalization
Neural Information Processing Systems (NeurIPS), 2023
Hao Dong
Ismail Nejjar
Han Sun
Eleni Chatzi
Olga Fink
302
48
0
30 Oct 2023
MM-VID: Advancing Video Understanding with GPT-4V(ision)
Kevin Qinghong Lin
Faisal Ahmed
Linjie Li
Chung-Ching Lin
E. Azarnasab
...
Lin Liang
Zicheng Liu
Yumao Lu
Ce Liu
Lijuan Wang
MLLM
232
84
0
30 Oct 2023
A Survey on Knowledge Editing of Neural Networks
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Vittorio Mazzia
Alessandro Pedrani
Andrea Caciolai
Kay Rottmann
Davide Bernardi
KELM
411
38
0
30 Oct 2023
Domain Generalization in Computational Pathology: Survey and Guidelines
ACM Computing Surveys (ACM Comput. Surv.), 2023
Mostafa Jahanifar
M. Raza
Kesi Xu
T. Vuong
R. Jewsbury
...
Neda Zamanitajeddin
Jin Tae Kwak
S. Raza
F. Minhas
Nasir M. Rajpoot
OOD
260
34
0
30 Oct 2023
MOSEL: Inference Serving Using Dynamic Modality Selection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bodun Hu
Le Xu
Jeongyoon Moon
N. Yadwadkar
Aditya Akella
300
5
0
27 Oct 2023
ArchBERT: Bi-Modal Understanding of Neural Architectures and Natural Languages
Conference on Computational Natural Language Learning (CoNLL), 2023
Mohammad Akbari
Saeed Ranjbar Alvar
Behnam Kamranian
Amin Banitalebi-Dehkordi
Yong Zhang
AI4CE
139
2
0
26 Oct 2023
Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction
ACM Multimedia (ACM MM), 2023
Xuming Hu
Junzhe Chen
Aiwei Liu
Shiao Meng
Lijie Wen
Philip S. Yu
226
25
0
25 Oct 2023
Data Optimization in Deep Learning: A Survey
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Ou Wu
Rujing Yao
323
5
0
25 Oct 2023
Graph-based multimodal multi-lesion DLBCL treatment response prediction from PET images
Oriane Thiery
M. Rizkallah
C. Bailly
C. Bodet-Milin
Emmanuel Itti
R. Casasnovas
S. Gouill
T. Carlier
D. Mateus
MedIm
AI4CE
155
2
0
25 Oct 2023
Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer
Neural Information Processing Systems (NeurIPS), 2023
Namkyeong Lee
Heewoong Noh
Sungwon Kim
Dongmin Hyun
Gyoung S. Na
Chanyoung Park
297
9
0
24 Oct 2023
Malicious Agent Detection for Robust Multi-Agent Collaborative Perception
Yangheng Zhao
Zhen Xiang
Sheng Yin
Xianghe Pang
Siheng Chen
Yanfeng Wang
AAML
312
10
0
18 Oct 2023
Machine Learning for Urban Air Quality Analytics: A Survey
Jindong Han
Weijiao Zhang
Hao Liu
Hui Xiong
AI4CE
274
17
0
14 Oct 2023
Previous
1
2
3
...
6
7
8
...
17
18
19
Next