Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2301.04856
Cited By
Multimodal Deep Learning
International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Deep Learning"
50 / 844 papers shown
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2020
Zarana Parekh
Jason Baldridge
Daniel Cer
Austin Waters
Yinfei Yang
288
68
0
30 Apr 2020
Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis
Yifan Hao
Martin Q. Ma
Muqiao Yang
Ruslan Salakhutdinov
Louis-Philippe Morency
146
4
0
29 Apr 2020
EmbraceNet for Activity: A Deep Multimodal Fusion Architecture for Activity Recognition
Jun-Ho Choi
Jong-Seok Lee
72
22
0
29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
348
36
0
28 Apr 2020
Deep Auto-Encoders with Sequential Learning for Multimodal Dimensional Emotion Recognition
IEEE transactions on multimedia (TMM), 2020
Dung Nguyen
D. Nguyen
Rui Zeng
Thanh Thi Nguyen
Son N. Tran
Thin Nguyen
Sridha Sridharan
Clinton Fookes
124
57
0
28 Apr 2020
Data-driven Flood Emulation: Speeding up Urban Flood Predictions by Deep Convolutional Neural Networks
Journal of Flood Risk Management (JFRM), 2020
Zifeng Guo
J. Leitão
N. Simões
V. Moosavi
AI4CE
106
146
0
17 Apr 2020
How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
George Sterpu
Christian Saam
N. Harte
198
33
0
17 Apr 2020
Sound of Guns: Digital Forensics of Gun Audio Samples meets Artificial Intelligence
Multimedia tools and applications (MTA), 2020
Simone Raponi
I. M. Ali
Gabriele Oligeri
129
32
0
15 Apr 2020
Composite Travel Generative Adversarial Networks for Tabular and Sequential Population Synthesis
Godwin Badu-Marfo
Bilal Farooq
Zachary Patterson
112
36
0
15 Apr 2020
Analysis of Social Media Data using Multimodal Deep Learning for Disaster Response
International Conference on Information Systems for Crisis Response and Management (ISCRAM), 2020
Ferda Ofli
Firoj Alam
Muhammad Imran
140
123
0
14 Apr 2020
Multimodal Categorization of Crisis Events in Social Media
Computer Vision and Pattern Recognition (CVPR), 2020
Mahdi Abavisani
Liwei Wu
Shengli Hu
Joel R. Tetreault
A. Jaimes
289
113
0
10 Apr 2020
Deep Multimodal Feature Encoding for Video Ordering
Vivek Sharma
Makarand Tapaswi
Rainer Stiefelhagen
173
11
0
05 Apr 2020
Multimodal Material Classification for Robots using Spectroscopy and High Resolution Texture Imaging
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Zackory M. Erickson
Eliot Xing
Bharat Srirangam
Sonia Chernova
Charles C. Kemp
255
45
0
02 Apr 2020
Mapping individual differences in cortical architecture using multi-view representation learning
IEEE International Joint Conference on Neural Network (IJCNN), 2020
A. Sellami
Franccois-Xavier Dupé
Bastien Cagna
Hachem Kadri
Stéphane Ayache
Thierry Artières
S. Takerkart
142
10
0
01 Apr 2020
Shared Cross-Modal Trajectory Prediction for Autonomous Driving
Computer Vision and Pattern Recognition (CVPR), 2020
Chiho Choi
Joon Hee Choi
Srikanth Malla
Jiachen Li
489
73
0
01 Apr 2020
Knowledge as Priors: Cross-Modal Knowledge Generalization for Datasets without Superior Knowledge
Computer Vision and Pattern Recognition (CVPR), 2020
Long Zhao
Xi Peng
Yuxiao Chen
Mubbasir Kapadia
Dimitris N. Metaxas
261
68
0
01 Apr 2020
Fashion Meets Computer Vision: A Survey
ACM Computing Surveys (ACM CSUR), 2020
Wen-Huang Cheng
Sijie Song
Chieh-Yun Chen
S. Hidayati
Jiaying Liu
AI4TS
292
108
0
31 Mar 2020
Integrating Physiological Time Series and Clinical Notes with Deep Learning for Improved ICU Mortality Prediction
Satya Narayan Shukla
Benjamin M. Marlin
141
16
0
24 Mar 2020
Variational Inference for Deep Probabilistic Canonical Correlation Analysis
Mahdi Karami
Dale Schuurmans
BDL
149
4
0
09 Mar 2020
Adversarial Multimodal Representation Learning for Click-Through Rate Prediction
The Web Conference (WWW), 2020
Xiang Li
Chao Wang
Jiwei Tan
Xiaoyi Zeng
Dan Ou
Bo Zheng
129
59
0
07 Mar 2020
Deep Multi-Modal Sets
A. Reiter
Menglin Jia
Pu Yang
Ser-Nam Lim
BDL
222
4
0
03 Mar 2020
A Semi-supervised Graph Attentive Network for Financial Fraud Detection
Industrial Conference on Data Mining (IDM), 2019
Daixin Wang
J. Lin
Peng Cui
Quanhui Jia
Zhen Wang
Yanming Fang
Quan Yu
Jun Zhou
Shuang Yang
Yuan Qi
GNN
186
427
0
28 Feb 2020
RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper High-Accuracy and Low-Latency Spiking Neural Network
Computer Vision and Pattern Recognition (CVPR), 2020
Bing Han
G. Srinivasan
Kaushik Roy
271
376
0
25 Feb 2020
Real-time Fusion Network for RGB-D Semantic Segmentation Incorporating Unexpected Obstacle Detection for Road-driving Images
IEEE Robotics and Automation Letters (RA-L), 2020
Lei Sun
Kailun Yang
Xinxin Hu
Weijian Hu
Kaiwei Wang
SSeg
286
155
0
24 Feb 2020
AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent Videos with Deep Learning
IEEE transactions on multimedia (TMM), 2020
Sanchita Ghose
John J. Prevost
VGen
180
50
0
21 Feb 2020
Neural Attentive Multiview Machines
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Oren Barkan
Ori Katz
Noam Koenigstein
HAI
127
22
0
18 Feb 2020
Learning Robust Representations via Multi-View Information Bottleneck
International Conference on Learning Representations (ICLR), 2020
Marco Federici
Anjan Dutta
Patrick Forré
Nate Kushman
Zeynep Akata
SLR
255
309
0
17 Feb 2020
Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis
IEEE Transactions on Medical Imaging (TMI), 2020
Tao Zhou
Huazhu Fu
Geng Chen
Jianbing Shen
Ling Shao
MedIm
370
325
0
11 Feb 2020
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
630
232
0
23 Jan 2020
Multimodal Deep Unfolding for Guided Image Super-Resolution
IEEE Transactions on Image Processing (TIP), 2020
Iman Marivani
Evaggelia Tsiligianni
Bruno Cornelis
Nikos Deligiannis
SupR
204
50
0
21 Jan 2020
A multimodal deep learning approach for named entity recognition from social media
M. Asgari-Chenaghlu
M. Feizi-Derakhshi
Leili Farzinvash
M. Balafar
C. Motamed
282
36
0
19 Jan 2020
Deep Audio-Visual Learning: A Survey
International Journal of Automation and Computing (IJAC), 2020
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
223
178
0
14 Jan 2020
Improved Robust ASR for Social Robots in Public Spaces
Charles Jankowski
Vishwas Mruthyunjaya
Ruixi Lin
VLM
87
3
0
14 Jan 2020
Multiview Representation Learning for a Union of Subspaces
Nils Holzenberger
R. Arora
89
1
0
30 Dec 2019
Learning from Learning Machines: Optimisation, Rules, and Social Norms
Travis LaCroix
Yoshua Bengio
85
7
0
29 Dec 2019
Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis
IEEE Transactions on Medical Imaging (TMI), 2019
Richard J. Chen
Ming Y. Lu
Jingwen Wang
Drew F. K. Williamson
S. Rodig
N. Lindeman
Faisal Mahmood
377
545
0
18 Dec 2019
Multimodal Self-Supervised Learning for Medical Image Analysis
Information Processing in Medical Imaging (IPMI), 2019
Aiham Taleb
Christoph Lippert
T. Klein
Moin Nabi
SSL
353
122
0
11 Dec 2019
Multimodal Generative Models for Compositional Representation Learning
Mike Wu
Noah D. Goodman
GAN
DRL
210
20
0
11 Dec 2019
Self-Supervised Learning of Video-Induced Visual Invariances
Computer Vision and Pattern Recognition (CVPR), 2019
Michael Tschannen
Josip Djolonga
Marvin Ritter
Aravindh Mahendran
Xiaohua Zhai
N. Houlsby
Sylvain Gelly
Mario Lucic
SSL
370
65
0
05 Dec 2019
See and Read: Detecting Depression Symptoms in Higher Education Students Using Multimodal Social Media Data
International Conference on Web and Social Media (ICWSM), 2019
Paulo Mann
A. Paes
Elton H. Matsushima
208
44
0
03 Dec 2019
Dividing and Conquering Cross-Modal Recipe Retrieval: from Nearest Neighbours Baselines to SoTA
Mikhail Fain
Niall Twomey
Andrey Ponikar
Ryan Fox
Danushka Bollegala
242
20
0
28 Nov 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Neural Information Processing Systems (NeurIPS), 2019
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
503
461
0
28 Nov 2019
MMTM: Multimodal Transfer Module for CNN Fusion
Computer Vision and Pattern Recognition (CVPR), 2019
Hamid Reza Vaezi Joze
Amirreza Shaban
Michael L. Iuzzolino
K. Koishida
407
350
0
20 Nov 2019
Modal-aware Features for Multimodal Hashing
Haien Zeng
Hanjiang Lai
Hanlu Chu
Yong Tang
Jian Yin
160
0
0
19 Nov 2019
VLUC: An Empirical Benchmark for Video-Like Urban Computing on Citywide Crowd and Traffic Prediction
Renhe Jiang
Zekun Cai
Zhaonan Wang
Chuang Yang
Z. Fan
Xuan Song
Kota Tsubouchi
Ryosuke Shibasaki
AI4TS
111
11
0
16 Nov 2019
Towards Pose-invariant Lip-Reading
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Shiyang Cheng
Pingchuan Ma
Georgios Tzimiropoulos
Stavros Petridis
Adrian Bulat
Jie Shen
Maja Pantic
270
32
0
14 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
IEEE Journal on Selected Topics in Signal Processing (JSTSP), 2019
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
325
408
0
10 Nov 2019
Adaptive Fusion Techniques for Multimodal Data
Gaurav Sahu
Olga Vechtomova
140
16
0
10 Nov 2019
Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models
Neural Information Processing Systems (NeurIPS), 2019
Yuge Shi
Siddharth Narayanaswamy
Brooks Paige
Juil Sock
DRL
255
324
0
08 Nov 2019
Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning
Valentin Vielzeuf
Alexis Lechervy
S. Pateux
F. Jurie
CVBM
164
5
0
08 Nov 2019
Previous
1
2
3
...
8
9
10
...
15
16
17
Next
Page 9 of 17
Page
of 17
Go