Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.09406
Cited By
v1
v2 (latest)
Multimodal Machine Learning: A Survey and Taxonomy
26 May 2017
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Machine Learning: A Survey and Taxonomy"
50 / 941 papers shown
Confidence-aware multi-modality learning for eye disease screening
K. Zou
Tian Lin
Zongbo Han
Meng Wang
Xuedong Yuan
Haoyu Chen
Changqing Zhang
Xiaojing Shen
Huazhu Fu
166
10
0
28 May 2024
The Evolution of Multimodal Model Architectures
S. Wadekar
Abhishek Chaurasia
Vasu Sharma
Eugenio Culurciello
321
27
0
28 May 2024
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Yake Wei
Di Hu
295
60
0
28 May 2024
Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning
Zihua Zhao
Mengxi Chen
Tianjie Dai
Jiangchao Yao
Bo han
Ya Zhang
Yanfeng Wang
NoLa
207
10
0
27 May 2024
Exploring a Multimodal Fusion-based Deep Learning Network for Detecting Facial Palsy
Nicole Heng Yim Oo
Min Hun Lee
Jeong Hoon Lim
CVBM
156
4
0
26 May 2024
Towards Natural Machine Unlearning
Zhengbao He
Tao Li
Xinwen Cheng
Zhehao Huang
Xiaolin Huang
MU
343
7
0
24 May 2024
Space-aware Socioeconomic Indicator Inference with Heterogeneous Graphs
Xingchen Zou
Jiani Huang
Xixuan Hao
Yuhao Yang
Haomin Wen
Yibo Yan
Chao Huang
Chen Chao
Yuxuan Liang
216
3
0
23 May 2024
Review of deep learning models for crypto price prediction: implementation and evaluation
Jingyang Wu
Xinyi Zhang
Fangyixuan Huang
Haochen Zhou
Rohtiash Chandra
193
9
0
19 May 2024
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
572
623
0
16 May 2024
ReconBoost: Boosting Can Achieve Modality Reconcilement
International Conference on Machine Learning (ICML), 2024
Cong Hua
Qianqian Xu
Shilong Bao
Zhiyong Yang
Qingming Huang
204
38
0
15 May 2024
Alignment Helps Make the Most of Multimodal Data
Christian Arnold
Andreas Küpfer
327
2
0
14 May 2024
HoneyBee: A Scalable Modular Framework for Creating Multimodal Oncology Datasets with Foundational Embedding Models
Aakash Tripathi
Asim Waqas
M. Schabath
Yasin Yilmaz
Ghulam Rasool
547
6
0
13 May 2024
Representation Learning of Daily Movement Data Using Text Encoders
Alexander Capstick
Tianyu Cui
Yu Chen
Payam Barnaghi
AI4TS
236
2
0
07 May 2024
CVTGAD: Simplified Transformer with Cross-View Attention for Unsupervised Graph-level Anomaly Detection
Jindong Li
Qianli Xing
Zhiqiang Zhang
Yi Chang
ViT
170
16
0
03 May 2024
Empowering Time Series Analysis with Foundation Models: A Comprehensive Survey
Weiqi Zhang
Yongzi Yu
Ke Yi
Yongzi Yu
Ziyue Li
Jia Li
AI4TS
AI4CE
442
34
0
03 May 2024
Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving
Zhenjiang Mao
Dong-You Jhong
Ao Wang
Ivan Ruchkin
OODD
246
2
0
02 May 2024
MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection
H. R. Medeiros
David Latortue
Fidel Alejandro Guerrero Peña
Eric Granger
M. Pedersoli
191
0
0
29 Apr 2024
M3H: Multimodal Multitask Machine Learning for Healthcare
Dimitris Bertsimas
Yu Ma
196
6
0
29 Apr 2024
Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Qingyang Zhang
Yake Wei
Zongbo Han
Huazhu Fu
Xi Peng
...
Qinghua Hu
Cai Xu
Jie Wen
Di Hu
Changqing Zhang
308
61
0
27 Apr 2024
AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models
Zhiqiang Tang
Haoyang Fang
Su Zhou
Taojiannan Yang
Zihan Zhong
Tony Hu
Katrin Kirchhoff
George Karypis
309
30
0
24 Apr 2024
A review of deep learning-based information fusion techniques for multimodal medical image classification
Yi-Hsuan Li
Mostafa EL HABIB DAHO
Pierre-Henri Conze
Rachid Zeghlache
Hugo Le Boité
R. Tadayoni
B. Cochener
M. Lamard
G. Quellec
172
112
0
23 Apr 2024
Leveraging Speech for Gesture Detection in Multimodal Communication
E. Ghaleb
I. Burenko
Marlou Rasenberg
Wim Pouw
Ivan Toni
Peter Uhrig
Anna Wilson
Judith Holler
Asli Ozyurek
Raquel Fernández
SLR
185
5
0
23 Apr 2024
Machine Learning Techniques for MRI Data Processing at Expanding Scale
Taro Langner
182
0
0
22 Apr 2024
Cooperative Sentiment Agents for Multimodal Sentiment Analysis
Shan Wang
Hui Shuai
Qingshan Liu
Fei Wang
LLMAG
271
4
0
19 Apr 2024
Dynamic Modality and View Selection for Multimodal Emotion Recognition with Missing Modalities
Luciana Trinkaus Menon
Luiz Carlos Ribeiro Neduziak
J. P. Barddal
A. L. Koerich
A. Britto
180
0
0
18 Apr 2024
MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition
Naichuan Zheng
Hailun Xia
Zeyu Liang
Yuchen Du
351
4
0
16 Apr 2024
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
Noah Lewis
J. L. Bez
Suren Byna
475
4
0
16 Apr 2024
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks
Yang Yang
Hongpeng Pan
Qingjun Jiang
Yi Tian Xu
Jinghui Tang
186
19
0
12 Apr 2024
Progressive Alignment with VLM-LLM Feature to Augment Defect Classification for the ASE Dataset
Chih-Chung Hsu
Chia-Ming Lee
Chun-Hung Sun
Kuang-Ming Wu
153
0
0
08 Apr 2024
A Data-to-Product Multimodal Conceptual Framework to Achieve Automated Software Evolution for Context-rich Intelligent Applications
Songhui Yue
198
3
0
07 Apr 2024
Continual Learning for Smart City: A Survey
Li Yang
Zhipeng Luo
Shi-sheng Zhang
Fei Teng
Tian-Jie Li
HAI
260
16
0
01 Apr 2024
Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models
Anees Ur Rehman Hashmi
Dwarikanath Mahapatra
Mohammad Yaqub
VLM
MedIm
128
4
0
27 Mar 2024
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Wenqiao Zhang
Tianwei Lin
Jiang Liu
Fangxun Shu
Haoyuan Li
...
Zheqi Lv
Hao Jiang
Juncheng Li
Siliang Tang
Yueting Zhuang
VLM
MLLM
235
16
0
20 Mar 2024
Language Evolution with Deep Learning
Mathieu Rita
Paul Michel
Rahma Chaabouni
Olivier Pietquin
Emmanuel Dupoux
Florian Strub
201
3
0
18 Mar 2024
Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity
Zhuo Zhi
Ziquan Liu
M. Elbadawi
Adam Daneshmend
Mine Orlu
Abdul Basit
Andreas Demosthenous
Miguel R. D. Rodrigues
276
4
0
14 Mar 2024
S^2MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering
Computer Vision and Pattern Recognition (CVPR), 2024
Zhen Long
Qiyuan Wang
Yazhou Ren
Yipeng Liu
Ce Zhu
278
11
0
14 Mar 2024
A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product
Ao Xiang
Zongqing Qi
Han Wang
Qin Yang
Danqing Ma
160
34
0
13 Mar 2024
Credibility-Aware Multi-Modal Fusion Using Probabilistic Circuits
Sahil Sidheekh
Pranuthi Tenali
Saurabh Mathur
Erik Blasch
Kristian Kersting
S. Natarajan
228
4
0
05 Mar 2024
TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification
Tong Zheng
Shusaku Sone
Yoshitaka Ushiku
Yuki Oba
Jiaxin Ma
391
1
0
04 Mar 2024
Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy, Advances, and Outlook
Xingchen Zou
Yibo Yan
Xixuan Hao
Yuehong Hu
Haomin Wen
...
Junbo Zhang
Yong Li
Tianrui Li
Yu Zheng
Yuxuan Liang
HAI
AI4TS
323
82
0
29 Feb 2024
Learning Invariant Inter-pixel Correlations for Superpixel Generation
Sen Xu
Shikui Wei
Tao Ruan
Lixin Liao
SupR
197
10
0
28 Feb 2024
Demonstrating and Reducing Shortcuts in Vision-Language Representation Learning
Maurits J. R. Bleeker
Mariya Hendriksen
Andrew Yates
Maarten de Rijke
VLM
319
9
0
27 Feb 2024
An Empirical Evaluation of Neural and Neuro-symbolic Approaches to Real-time Multimodal Complex Event Detection
Liying Han
Mani Srivastava
107
3
0
17 Feb 2024
Quantifying and Enhancing Multi-modal Robustness with Modality Preference
Zequn Yang
Yake Wei
Ce Liang
Di Hu
AAML
324
22
0
09 Feb 2024
A Survey on Safe Multi-Modal Learning System
Tianyi Zhao
Liangliang Zhang
Yao Ma
Lu Cheng
511
21
0
08 Feb 2024
Designing deep neural networks for driver intention recognition
Koen Vellenga
H. Steinhauer
Alexander Karlsson
Göran Falkman
A. Rhodin
A. Koppisetty
174
3
0
07 Feb 2024
The Future of Cognitive Strategy-enhanced Persuasive Dialogue Agents: New Perspectives and Trends
Mengqi Chen
Bin Guo
Hao Wang
Haoyu Li
Qian Zhao
Jingqi Liu
Yasan Ding
Yan Pan
Zhiwen Yu
LLMAG
235
4
0
07 Feb 2024
Review of multimodal machine learning approaches in healthcare
"Felix H. Krones
Umar Marikkar
Guy Parsons
Adam Szmul
Adam Mahdi
364
89
0
04 Feb 2024
Generative Visual Compression: A Review
Bo Chen
Shanzhi Yin
Peilin Chen
Shiqi Wang
Yan Ye
200
15
0
03 Feb 2024
Multi-Modal Machine Learning Framework for Automated Seizure Detection in Laboratory Rats
Aaron D. Mullen
Samuel E. Armstrong
Jasmine Perdeh
Bjorn Bauer
Jeff Talbert
V. Bumgardner
62
2
0
01 Feb 2024
Previous
1
2
3
...
5
6
7
...
17
18
19
Next