v1v2 (latest)

Multimodal Machine Learning: A Survey and Taxonomy

26 May 2017

T. Baltrušaitis

Chaitanya Ahuja

Louis-Philippe Morency

ArXiv (abs)PDF HTML

Papers citing "Multimodal Machine Learning: A Survey and Taxonomy"

50 / 941 papers shown

General Greedy De-bias LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

480

20 Dec 2021

Dual-Key Multimodal Backdoors for Visual Question Answering

161

14 Dec 2021

Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking

Yidi Li

Hong Liu

Hao Tang

241

14 Dec 2021

Data Collection and Quality Challenges in Deep Learning: A Data-Centric AI Perspective

452

463

13 Dec 2021

A graph representation based on fluid diffusion model for data analysis: theoretical aspects and enhanced community detection

Andrea Marinoni

Christian Jutten

Mark Girolami

361

07 Dec 2021

Contrastive Cycle Adversarial Autoencoders for Single-cell Multi-omics Alignment and IntegrationbioRxiv (bioRxiv), 2021

226

05 Dec 2021

Active Sensing for Search and Tracking: A Review

Luca Varotto

Angelo Cenedese

Andrea Cavallaro

162

04 Dec 2021

Channel Exchanging Networks for Multimodal and Multitask Dense Image PredictionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

287

04 Dec 2021

Shapes of Emotions: Multimodal Emotion Recognition in Conversations via Emotion Shifts

185

03 Dec 2021

Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval

218

03 Dec 2021

ContIG: Self-supervised Multimodal Contrastive Learning for Medical Imaging with GeneticsComputer Vision and Pattern Recognition (CVPR), 2021

615

26 Nov 2021

Geometric Multimodal Deep Learning with Multi-Scaled Graph Wavelet Convolutional NetworkIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

M. Behmanesh

Peyman Adibi

M. Ehsani

Jocelyn Chanussot

213

26 Nov 2021

Sparse Fusion for Multimodal Transformers

169

23 Nov 2021

GN-Transformer: Fusing Sequence and Graph Representation for Improved Code Summarization

Junyan Cheng

Iordanis Fostiropoulos

Barry W. Boehm

123

17 Nov 2021

TorchGeo: Deep Learning With Geospatial Data

Adam J. Stewart

Arindam Banerjee

375

105

17 Nov 2021

Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma DistributionsNeural Information Processing Systems (NeurIPS), 2021

258

11 Nov 2021

A framework for comprehensible multi-modal detection of cyber threats

107

10 Nov 2021

Social Fraud Detection Review: Methods, Challenges and Analysis

Wei Liu

220

10 Nov 2021

Cross Attentional Audio-Visual Fusion for Dimensional Emotion RecognitionIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2021

208

09 Nov 2021

How does a Pre-Trained Transformer Integrate Contextual Keywords? Application to Humanitarian Computing

Valentin Barrière

Guillaume Jacquet

07 Nov 2021

ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving VehicleInternational Conference on Multimodal Interaction (ICMI), 2021

Amr Gomaa

Guillermo Reyes

Michael Feld

03 Nov 2021

A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches

284

03 Nov 2021

A Survey on Epistemic (Model) Uncertainty in Supervised Learning: Recent Advances and Applications

321

03 Nov 2021

Latent Structure Mining with Contrastive Modality Fusion for Multimedia RecommendationIEEE Transactions on Knowledge and Data Engineering (TKDE), 2021

Liang Wang

278

01 Nov 2021

Revisit Multimodal Meta-Learning through the Lens of Multi-Task Learning

Milad Abdollahzadeh

Touba Malekzadeh

Ngai-Man Cheung

153

27 Oct 2021

Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image CaptioningIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021

Yang Yang

Haoran Wei

Hengshu Zhu

Dianhai Yu

Hui Xiong

Jian Yang

SSL

107

22 Oct 2021

Deep multi-modal aggregation network for MR image reconstruction with auxiliary modality

225

15 Oct 2021

From Multimodal to Unimodal Attention in Transformers using Knowledge Distillation

141

15 Oct 2021

StreaMulT: Streaming Multimodal Transformer for Heterogeneous and Arbitrary Long Sequential Data

197

15 Oct 2021

DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning

Yizhi Wang

Zheng Lian

3DV

290

13 Oct 2021

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training ParadigmInternational Conference on Learning Representations (ICLR), 2021

Wanli Ouyang

424

540

11 Oct 2021

Embed Everything: A Method for Efficiently Co-Embedding Multi-Modal Spaces

Sarah Di

Robin Yu

Amol Kapoor

09 Oct 2021

On the Limitations of Multimodal VAEsInternational Conference on Learning Representations (ICLR), 2021

295

08 Oct 2021

3D-MOV: Audio-Visual LSTM Autoencoder for 3D Reconstruction of Multiple Objects from Video

Justin Wilson

Ming-Chia Lin

119

05 Oct 2021

Deep Neural Networks and Tabular Data: A Survey

Gjergji Kasneci

542

977

05 Oct 2021

Neural Dependency Coding inspired Multimodal Fusion

Shiv Shankar

256

28 Sep 2021

Multimodality in Meta-Learning: A Comprehensive Survey

Irwin King

262

28 Sep 2021

UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation

Zhengkun Zhang

Xiaojun Meng

Yasheng Wang

Xin Jiang

Qun Liu

Zhenglu Yang

174

13 Sep 2021

TEASEL: A Transformer-Based Speech-Prefixed Language Model

Mehdi Arjmand

M. Dousti

H. Moradi

144

12 Sep 2021

A Survey on Multi-modal Summarization

206

11 Sep 2021

Multimodal Federated Learning on IoT DataInternational Conference on Internet-of-Things Design and Implementation (IoTDI), 2021

Yuchen Zhao

Payam Barnaghi

Hamed Haddadi

146

100

10 Sep 2021

Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation ModelsAAAI Conference on Artificial Intelligence (AAAI), 2021

226

08 Sep 2021

Hybrid Contrastive Learning of Tri-Modal Representation for Multimodal Sentiment Analysis

Sijie Mai

Ying Zeng

Shuangjia Zheng

Haifeng Hu

163

187

04 Sep 2021

Improving Multimodal fusion via Mutual Dependency MaximisationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

296

31 Aug 2021

Vision-Language Navigation: A Survey and Taxonomy

333

26 Aug 2021

Maximum Likelihood Estimation for Multimodal Learning with Missing Modality

215

24 Aug 2021

Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach

Chuanbo Hu

Minglei Yin

Bin Liu

Xin Li

Yanfang Ye

107

19 Aug 2021

Emotion Recognition from Multiple Modalities: Fundamentals and Methodologies

221

135

18 Aug 2021

Affect-Aware Deep Belief Network Representations for Multimodal Unsupervised Deception Detection

Leena Mathur

Maja J. Matarić

CVBM

137

17 Aug 2021

Interpretable Visual Understanding with Cognitive Attention NetworkInternational Conference on Artificial Neural Networks (ICANN), 2021

Wenbin Zhang

281

06 Aug 2021