Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.09406
Cited By
v1
v2 (latest)
Multimodal Machine Learning: A Survey and Taxonomy
26 May 2017
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Machine Learning: A Survey and Taxonomy"
50 / 941 papers shown
MultiSubs: A Large-scale Multimodal and Multilingual Dataset
International Conference on Language Resources and Evaluation (LREC), 2021
Josiah Wang
Pranava Madhyastha
J. Figueiredo
Chiraag Lala
Lucia Specia
VGen
193
13
0
02 Mar 2021
Investigations on Audiovisual Emotion Recognition in Noisy Conditions
Spoken Language Technology Workshop (SLT), 2021
Michael Neumann
Ngoc Thang Vu
146
10
0
02 Mar 2021
Variational Selective Autoencoder: Learning from Partially-Observed Heterogeneous Data
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Yu Gong
Hossein Hajimirsadeghi
Jiawei He
Thibaut Durand
Greg Mori
CML
109
11
0
25 Feb 2021
Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case
International Conference on Computational Linguistics (COLING), 2021
Adam Dahlgren Lindström
Suna Bensch
Johanna Björklund
F. Drewes
159
25
0
22 Feb 2021
Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction
IEEE Robotics and Automation Letters (RA-L), 2021
Daniel Gehrig
Michelle Rüegg
Mathias Gehrig
Javier Hidalgo-Carrió
Davide Scaramuzza
188
150
0
18 Feb 2021
Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis
AAAI Conference on Artificial Intelligence (AAAI), 2021
Wenmeng Yu
Hua Xu
Ziqi Yuan
Jiele Wu
SSL
230
639
0
09 Feb 2021
Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Leena Mathur
Maja J. Matarić
158
18
0
06 Feb 2021
MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records
AAAI Conference on Artificial Intelligence (AAAI), 2021
Zhen Xu
David R. So
Andrew M. Dai
Mamba
338
65
0
03 Feb 2021
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers
Mahsa Shafaei
C. Smailis
I. Kakadiaris
Thamar Solorio
798
3
0
26 Jan 2021
Narration Generation for Cartoon Videos
Nikos Papasarantopoulos
Shay B. Cohen
VGen
202
2
0
17 Jan 2021
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements
IEEE Transactions on Affective Computing (TAC), 2021
Lukas Stappen
Alice Baird
Lea Schumann
Björn Schuller
232
71
0
15 Jan 2021
Trace Ratio Optimization with an Application to Multi-view Learning
Mathematical programming (Math. Program.), 2021
Li Wang
Lei-Hong Zhang
Ren-Cang Li
92
11
0
12 Jan 2021
On-Device Document Classification using multimodal features
Sugam Garg
SS Harichandana
Sumit Kumar
78
3
0
06 Jan 2021
P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding
Yunze Liu
Li Yi
Shanghang Zhang
Qingnan Fan
Thomas Funkhouser
Hao Dong
SSL
258
62
0
24 Dec 2020
Human Action Recognition from Various Data Modalities: A Review
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
582
699
0
22 Dec 2020
Forecasting Irreversible Disease via Progression Learning
Botong Wu
Sijie Ren
Jing Li
Xinwei Sun
Shiming Li
Yizhou Wang
MedIm
74
0
0
21 Dec 2020
Where, What, Whether: Multi-modal Learning Meets Pedestrian Detection
Computer Vision and Pattern Recognition (CVPR), 2020
Yan Luo
Chongyang Zhang
Muming Zhao
Hao Zhou
Jun Sun
197
28
0
20 Dec 2020
Trying Bilinear Pooling in Video-QA
T. Winterbottom
S. Xiao
A. McLean
Noura Al Moubayed
207
4
0
18 Dec 2020
On Modality Bias in the TVQA Dataset
British Machine Vision Conference (BMVC), 2020
T. Winterbottom
S. Xiao
A. McLean
Noura Al Moubayed
174
44
0
18 Dec 2020
MSAF: Multimodal Split Attention Fusion
Lang Su
Chuqing Hu
Guofa Li
Dongpu Cao
279
55
0
13 Dec 2020
AffectON: Incorporating Affect Into Dialog Generation
IEEE Transactions on Affective Computing (TAC), 2020
Zana Buçinca
Y. Yemez
E. Erzin
T. Metin Sezgin
145
4
0
12 Dec 2020
Multi-modal Visual Tracking: Review and Experimental Comparison
Pengyu Zhang
Dong Wang
Huchuan Lu
260
39
0
08 Dec 2020
Parameter Efficient Multimodal Transformers for Video Representation Learning
Sangho Lee
Youngjae Yu
Gunhee Kim
Thomas Breuel
Jan Kautz
Yale Song
ViT
272
89
0
08 Dec 2020
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment
ACM Multimedia (ACM MM), 2020
Paul Pu Liang
Peter Wu
Liu Ziyin
Louis-Philippe Morency
Ruslan Salakhutdinov
161
36
0
04 Dec 2020
Multimodal Privacy-preserving Mood Prediction from Mobile Data: A Preliminary Study
Terrance Liu
Paul Pu Liang
Michal Muszynski
Ryo Ishii
David Brent
Randy P. Auerbach
Nicholas B. Allen
Louis-Philippe Morency
137
8
0
04 Dec 2020
Detect, Reject, Correct: Crossmodal Compensation of Corrupted Sensors
IEEE International Conference on Robotics and Automation (ICRA), 2020
Michelle A. Lee
Matthew Tan
Yuke Zhu
Jeannette Bohg
226
25
0
01 Dec 2020
Analyzing Unaligned Multimodal Sequence via Graph Convolution and Graph Pooling Fusion
Sijie Mai
Songlong Xing
Jiaxuan He
Ying Zeng
Haifeng Hu
GNN
234
24
0
27 Nov 2020
Reflective-Net: Learning from Explanations
Data mining and knowledge discovery (DMKD), 2020
Johannes Schneider
Michalis Vlachos
FAtt
OffRL
LRM
386
20
0
27 Nov 2020
Uncorrelated Semi-paired Subspace Learning
Li Wang
Lei-Hong Zhang
Chungen Shen
Ren-Cang Li
SSL
84
0
0
22 Nov 2020
Hierachical Delta-Attention Method for Multimodal Fusion
Kunjal Panchal
190
1
0
22 Nov 2020
Contextual Fusion For Adversarial Robustness
Aiswarya Akumalla
S. Haney
M. Bazhenov
AAML
102
2
0
18 Nov 2020
On the Benefits of Early Fusion in Multimodal Representation Learning
George M. Barnum
Sabera Talukder
Yisong Yue
96
60
0
14 Nov 2020
Deep Partial Multi-View Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Changqing Zhang
Yajie Cui
Zongbo Han
Qiufeng Wang
Huazhu Fu
Q. Hu
222
285
0
12 Nov 2020
Deep Multimodal Fusion by Channel Exchanging
Yikai Wang
Wenbing Huang
Gang Hua
Qifeng Bai
Yu Rong
Junzhou Huang
311
281
0
10 Nov 2020
Robust Latent Representations via Cross-Modal Translation and Alignment
Vandana Rajan
Alessio Brutti
Andrea Cavallaro
214
13
0
03 Nov 2020
Personalized Multimodal Feedback Generation in Education
International Conference on Computational Linguistics (COLING), 2020
Haochen Liu
Zitao Liu
Zhongqin Wu
Shucheng Zhou
132
14
0
31 Oct 2020
Multimodal Sensor Fusion with Differentiable Filters
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Michelle A. Lee
Brent Yi
Roberto Martín-Martín
Silvio Savarese
Jeannette Bohg
154
68
0
25 Oct 2020
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
277
6
0
19 Oct 2020
Deep-HOSeq: Deep Higher Order Sequence Fusion for Multimodal Sentiment Analysis
Industrial Conference on Data Mining (IDM), 2020
Sunny Verma
Jiwei Wang
Zhefeng Ge
Rujia Shen
Fan Jin
Yang Wang
Fang Chen
Wei Liu
105
26
0
16 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Neurocomputing (Neurocomputing), 2020
Wei Chen
Weiping Wang
Tianpeng Liu
M. Lew
VLM
329
36
0
16 Oct 2020
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation
Rohan Kumar Das
Ruijie Tao
Jichen Yang
Wei Rao
Cheng Yu
Haizhou Li
120
11
0
08 Oct 2020
Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle
International Conference on Multimodal Interaction (ICMI), 2020
Amr Gomaa
Guillermo Reyes
Alexandra Alles
L. Rupp
Michael Feld
91
24
0
23 Sep 2020
The Use of AI for Thermal Emotion Recognition: A Review of Problems and Limitations in Standard Design and Data
Catherine Ordun
Edward Raff
S. Purushotham
121
17
0
22 Sep 2020
Modality-Transferable Emotion Embeddings for Low-Resource Multimodal Emotion Recognition
Wenliang Dai
Zihan Liu
Tiezheng Yu
Pascale Fung
328
41
0
21 Sep 2020
A Multimodal Memes Classification: A Survey and Open Research Issues
Tariq Habib Afridi
A. Alam
Muhammad Numan Khan
Jawad Khan
Young-Koo Lee
210
43
0
17 Sep 2020
Themes Informed Audio-visual Correspondence Learning
Runze Su
Fei Tao
Xudong Liu
Haoran Wei
Xiaorong Mei
Z. Duan
Lei Yuan
Ji Liu
Yuying Xie
195
6
0
14 Sep 2020
FairCVtest Demo: Understanding Bias in Multimodal Learning with a Testbed in Fair Automatic Recruitment
International Conference on Multimodal Interaction (ICMI), 2020
Alejandro Peña
Ignacio Serna
Aythami Morales
Julian Fierrez
FaML
105
13
0
12 Sep 2020
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
445
722
0
10 Sep 2020
Multimodal Deep Learning for Flaw Detection in Software Programs
S. Heidbrink
Kathryn N. Rodhouse
Daniel M. Dunlavy
125
9
0
09 Sep 2020
Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity
ACM Transactions on Graphics (TOG), 2020
Youngwoo Yoon
Bok Cha
Joo-Haeng Lee
Minsu Jang
Jaeyeon Lee
Jaehong Kim
Geehyuk Lee
316
339
0
04 Sep 2020
Previous
1
2
3
...
15
16
17
18
19
Next