Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.09406
Cited By
v1
v2 (latest)
Multimodal Machine Learning: A Survey and Taxonomy
26 May 2017
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Machine Learning: A Survey and Taxonomy"
50 / 941 papers shown
Introducing Representations of Facial Affect in Automated Multimodal Deception Detection
International Conference on Multimodal Interaction (ICMI), 2020
Leena Mathur
Maja J. Matarić
CVBM
195
42
0
31 Aug 2020
Collaborative Multi-Robot Systems for Search and Rescue: Coordination and Perception
Jorge Peña Queralta
Jussi Taipalmaa
Bilge Can Pullinen
V. Sarker
Tuan Anh Nguyen Gia
H. Tenhunen
Moncef Gabbouj
Jenni Raitoharju
Tomi Westerlund
139
32
0
28 Aug 2020
Training Multimodal Systems for Classification with Multiple Objectives
Jason Armitage
Shramana Thakur
Rishi Tripathi
Jens Lehmann
M. Maleshkova
160
1
0
26 Aug 2020
A Baseline Analysis for Podcast Abstractive Summarization
Chujie Zheng
Harry J. Wang
Kunpeng Zhang
Ling Fan
181
13
0
24 Aug 2020
A Efficient Multimodal Framework for Large Scale Emotion Recognition by Fusing Music and Electrodermal Activity Signals
Guanghao Yin
Shouqian Sun
Dian Yu
Dejian Li
Ke-jun Zhang
121
34
0
22 Aug 2020
A Survey of Visual Analytics Techniques for Machine Learning
Jun Yuan
Changjian Chen
Weikai Yang
Mengchen Liu
Jiazhi Xia
Shixia Liu
286
255
0
21 Aug 2020
Linguistically-aware Attention for Reducing the Semantic-Gap in Vision-Language Tasks
K. Gouthaman
Athira M. Nambiar
K. Srinivas
Anurag Mittal
VLM
256
14
0
18 Aug 2020
Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention
Bin Duan
Hao Tang
Wei Wang
Ziliang Zong
Guowei Yang
Yan Yan
156
72
0
14 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
164
87
0
11 Aug 2020
Auto-weighting for Breast Cancer Classification in Multimodal Ultrasound
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2020
Jian Wang
Juzheng Miao
Yang Xin
Rui Li
Guangquan Zhou
...
Wufeng Xue
X. Jia
Jianqiao Zhou
Ruobing Huang
Dong Ni
133
27
0
08 Aug 2020
HAMLET: A Hierarchical Multimodal Attention-based Human Activity Recognition Algorithm
Md. Mofijul Islam
Tariq Iqbal
158
94
0
03 Aug 2020
Characterizing Communities of Hashtag Usage on Twitter During the 2020 COVID-19 Pandemic by Multi-view Clustering
Iain J. Cruickshank
Kathleen M. Carley
163
44
0
03 Aug 2020
AiRound and CV-BrCT: Novel Multi-View Datasets for Scene Classification
Gabriel L. S. Machado
E. Ferreira
Keiller Nogueira
Hugo Oliveira
P. H. T. Gama
J. D. Santos
98
28
0
03 Aug 2020
From Robotic Process Automation to Intelligent Process Automation: Emerging Trends
International Conference on Business Process Management (BPM), 2020
Tathagata Chakraborti
Vatche Isahagian
Rania Y. Khalaf
Y. Khazaeni
Vinod Muthusamy
Sadhana Kumaravel
Merve Unuvar
AI4CE
154
53
0
27 Jul 2020
Federated Self-Supervised Learning of Multi-Sensor Representations for Embedded Intelligence
IEEE Internet of Things Journal (IEEE IoT J.), 2020
Aaqib Saeed
Flora D. Salim
T. Ozcelebi
J. Lukkien
FedML
SSL
233
117
0
25 Jul 2020
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
S. Rehman
M. Waqas
Shanshan Tu
Anis Koubaa
O. Rehman
Jawad Ahmad
Muhammad Hanif
Zhu Han
150
8
0
21 Jul 2020
Audio-Visual Understanding of Passenger Intents for In-Cabin Conversational Agents
Eda Okur
Shachi H. Kumar
Saurav Sahay
L. Nachman
143
9
0
08 Jul 2020
MAMO: Memory-Augmented Meta-Optimization for Cold-start Recommendation
Manqing Dong
Feng Yuan
Lina Yao
Xiwei Xu
Liming Zhu
CLL
142
184
0
07 Jul 2020
Jointly Modeling Motion and Appearance Cues for Robust RGB-T Tracking
Pengyu Zhang
Jie Zhao
Dong Wang
Huchuan Lu
Xiaoyun Yang
167
176
0
04 Jul 2020
Deep Feature Space: A Geometrical Perspective
Ioannis Kansizoglou
Loukas Bampis
Antonios Gasteratos
307
46
0
30 Jun 2020
BERTERS: Multimodal Representation Learning for Expert Recommendation System with Transformer
Narjes Nikzad Khasmakhi
M. Balafar
M. Feizi-Derakhshi
C. Motamed
OffRL
94
36
0
30 Jun 2020
X-ModalNet: A Semi-Supervised Deep Cross-Modal Network for Classification of Remote Sensing Data
Isprs Journal of Photogrammetry and Remote Sensing (ISPRS J. Photogramm. Remote Sens.), 2020
Danfeng Hong
Xiangwei Zhu
Gui-Song Xia
J. Chanussot
X. Zhu
158
69
0
24 Jun 2020
Multimodal Generative Learning Utilizing Jensen-Shannon-Divergence
Thomas M. Sutter
Imant Daunhawer
Julia E. Vogt
294
89
0
15 Jun 2020
Towards Robust Pattern Recognition: A Review
Proceedings of the IEEE (Proc. IEEE), 2020
Xu-Yao Zhang
Cheng-Lin Liu
C. Suen
OOD
HAI
203
126
0
12 Jun 2020
Interpretable, similarity-driven multi-view embeddings from high-dimensional biomedical data
Brian B. Avants
Nicholas J. Tustison
J. Stone
171
18
0
11 Jun 2020
Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges
M. Eskénazi
Tiancheng Zhao
LLMAG
AI4TS
AI4CE
222
9
0
10 Jun 2020
mEBAL: A Multimodal Database for Eye Blink Detection and Attention Level Estimation
Roberto Daza
Aythami Morales
Julian Fierrez
Ruben Tolosana
CVBM
180
46
0
09 Jun 2020
Hysia: Serving DNN-Based Video-to-Retail Applications in Cloud
Huaizheng Zhang
Yuanming Li
Qiming Ai
Yong Luo
Yonggang Wen
Yichao Jin
T. Duong
3DH
114
11
0
09 Jun 2020
Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data
International Joint Conference on Artificial Intelligence (IJCAI), 2020
Haytham M. Fayek
Anurag Kumar
205
37
0
29 May 2020
Learning Tversky Similarity
International Conference on Information Processing and Management of Uncertainty (IPMU), 2020
J. Rahnama
Eyke Hüllermeier
116
6
0
27 May 2020
Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
George Sterpu
Christian Saam
N. Harte
114
7
0
19 May 2020
Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition
Di Hu
Xuhong Li
Lichao Mou
P. Jin
Dong Chen
L. Jing
Xiaoxiang Zhu
Dejing Dou
166
6
0
18 May 2020
COBRA: Contrastive Bi-Modal Representation Algorithm
Vishaal Udandarao
A. Maiti
Deepak Srivatsav
Suryatej Reddy Vyalla
Yifang Yin
R. Shah
221
28
0
07 May 2020
Designing Accurate Emulators for Scientific Processes using Calibration-Driven Deep Models
Nature Communications (Nat Commun), 2020
Jayaraman J. Thiagarajan
Bindya Venkatesh
Rushil Anirudh
P. Bremer
J. Gaffney
G. Anderson
B. Spears
220
24
0
05 May 2020
MultiQT: Multimodal Learning for Real-Time Question Tracking in Speech
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Jakob Drachmann Havtorn
Jan Latko
Joakim Edin
Lasse Borgholt
Lars Maaløe
Lorenzo Belgrano
Nicolai Frost Jakobsen
R. Sdun
Zeljko Agic
135
3
0
02 May 2020
Multi-View Self-Attention for Interpretable Drug-Target Interaction Prediction
Journal of Biomedical Informatics (JBI), 2020
Brighter Agyemang
Wei-Ping Wu
Michael Y. Kpiebaareh
Zhihua Lei
Ebenezer Nanor
Lei Chen
140
31
0
01 May 2020
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2020
Zarana Parekh
Jason Baldridge
Daniel Cer
Austin Waters
Yinfei Yang
274
68
0
30 Apr 2020
Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis
Yifan Hao
Martin Q. Ma
Muqiao Yang
Ruslan Salakhutdinov
Louis-Philippe Morency
143
4
0
29 Apr 2020
Skeleton Focused Human Activity Recognition in RGB Video
Bruce X. B. Yu
Yan Liu
Keith C. C. Chan
202
5
0
29 Apr 2020
Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun
Minghai Qin
Tianyun Zhang
Liu Liu
Yen-kuang Chen
Yuan Xie
314
7
0
24 Apr 2020
How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
George Sterpu
Christian Saam
N. Harte
191
32
0
17 Apr 2020
Bias in Multimodal AI: Testbed for Fair Automatic Recruitment
Alejandro Peña
Ignacio Serna
Aythami Morales
Julian Fierrez
159
64
0
15 Apr 2020
Brain-inspired self-organization with cellular neuromorphic computing for multimodal unsupervised learning
Lyes Khacef
Laurent Rodriguez
Benoit Miramond
217
17
0
11 Apr 2020
Conditioned Source Separation for Music Instrument Performances
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Olga Slizovskaia
G. Haro
E. Gómez
244
43
0
08 Apr 2020
Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework
IEEE transactions on multimedia (TMM), 2020
Yaochen Zhu
Jiayi Xie
Zhenzhong Chen
97
33
0
28 Mar 2020
End-to-End Entity Classification on Multimodal Knowledge Graphs
W. X. Wilcke
Peter Bloem
Victor de Boer
R. V. Veer
F. V. Harmelen
152
27
0
25 Mar 2020
Emotions Don't Lie: An Audio-Visual Deepfake Detection Method Using Affective Cues
ACM Multimedia (ACM MM), 2020
Trisha Mittal
Uttaran Bhattacharya
Rohan Chandra
Aniket Bera
Tianyi Zhou
415
306
0
14 Mar 2020
Adversarial Multimodal Representation Learning for Click-Through Rate Prediction
The Web Conference (WWW), 2020
Xiang Li
Chao Wang
Jiwei Tan
Xiaoyi Zeng
Dan Ou
Bo Zheng
123
59
0
07 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
AAAI Conference on Artificial Intelligence (AAAI), 2020
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
327
130
0
06 Mar 2020
ASMD: an automatic framework for compiling multimodal datasets with audio and scores
Federico Simonetta
Stavros Ntalampiras
F. Avanzini
190
7
0
04 Mar 2020
Previous
1
2
3
...
16
17
18
19
Next