ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.04856
  4. Cited By
Multimodal Deep Learning

Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
ArXiv (abs)PDFHTML

Papers citing "Multimodal Deep Learning"

50 / 841 papers shown
Title
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry
  and Semantics
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
Alex Kendall
Y. Gal
R. Cipolla
3DH
485
3,544
0
19 May 2017
Deep Multi-view Models for Glitch Classification
Deep Multi-view Models for Glitch Classification
S. Bahaadini
N. Rohani
S. Coughlin
M. Zevin
V. Kalogera
Aggelos K. Katsaggelos
87
46
0
28 Apr 2017
End-to-End Multimodal Emotion Recognition using Deep Neural Networks
End-to-End Multimodal Emotion Recognition using Deep Neural Networks
Panagiotis Tzirakis
George Trigeorgis
M. Nicolaou
Björn Schuller
Stefanos Zafeiriou
HAI
152
600
0
27 Apr 2017
Deep Cross-Modal Audio-Visual Generation
Deep Cross-Modal Audio-Visual Generation
Lele Chen
Sudhanshu Srivastava
Z. Duan
Chenliang Xu
188
228
0
26 Apr 2017
Tapping the sensorimotor trajectory
Tapping the sensorimotor trajectory
Oswald Berthold
Verena V. Hafner
96
0
0
25 Apr 2017
Semi-supervised Bayesian Deep Multi-modal Emotion Recognition
Semi-supervised Bayesian Deep Multi-modal Emotion Recognition
Changde Du
Changying Du
Jinpeng Li
Wei-Long Zheng
Bao-Liang Lu
Huiguang He
65
9
0
25 Apr 2017
Learning weakly supervised multimodal phoneme embeddings
Learning weakly supervised multimodal phoneme embeddings
Rahma Chaabouni
Ewan Dunbar
Neil Zeghidour
Emmanuel Dupoux
SSL
162
10
0
23 Apr 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal
  Attentions
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions
Amir Mazaheri
Dong Zhang
M. Shah
91
12
0
15 Apr 2017
Cross-media Similarity Metric Learning with Unified Deep Networks
Cross-media Similarity Metric Learning with Unified Deep Networks
Jinwei Qi
Xin Huang
Yuxin Peng
SSL
62
9
0
14 Apr 2017
Learning Joint Multilingual Sentence Representations with Neural Machine
  Translation
Learning Joint Multilingual Sentence Representations with Neural Machine Translation
Holger Schwenk
Matthijs Douze
303
216
0
13 Apr 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang
Yin Li
Jing-ling Huang
Svetlana Lazebnik
VLM
205
530
0
11 Apr 2017
Deep Multimodal Representation Learning from Temporal Data
Deep Multimodal Representation Learning from Temporal Data
Xitong Yang
Palghat Ramesh
Radha Chitta
S. Madhvanath
Edgar A. Bernal
Jiebo Luo
AI4TS
138
102
0
11 Apr 2017
Fine-graind Image Classification via Combining Vision and Language
Fine-graind Image Classification via Combining Vision and Language
Xiangteng He
Yuxin Peng
VLM
203
174
0
10 Apr 2017
Coupled Deep Learning for Heterogeneous Face Recognition
Coupled Deep Learning for Heterogeneous Face Recognition
Xiang Wu
Lingxiao Song
Ran He
Tieniu Tan
CVBM
285
104
0
08 Apr 2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search
AMC: Attention guided Multi-modal Correlation Learning for Image Search
Kan Chen
Trung Bui
Chen Fang
Zhaowen Wang
Ram Nevatia
120
38
0
03 Apr 2017
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional
  Neural Networks
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Jen-Cheng Hou
Syu-Siang Wang
Ying-Hui Lai
Yu Tsao
Hsiu-Wen Chang
H. Wang
275
23
0
30 Mar 2017
Multimodal deep learning approach for joint EEG-EMG data compression and
  classification
Multimodal deep learning approach for joint EEG-EMG data compression and classification
Ahmed Ben Said
Amr M. Mohamed
Tarek M. Elfouly
Khaled A. Harras
Z. J. Wang
97
83
0
27 Mar 2017
Joint Intermodal and Intramodal Label Transfers for Extremely Rare or
  Unseen Classes
Joint Intermodal and Intramodal Label Transfers for Extremely Rare or Unseen Classes
Guo-Jun Qi
Wen Liu
Charu C. Aggarwal
Thomas Huang
VLM
150
55
0
22 Mar 2017
High-Resolution Breast Cancer Screening with Multi-View Deep
  Convolutional Neural Networks
High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks
Krzysztof J. Geras
Stacey Wolfson
Yiqiu Shen
Nan Wu
S. G. Kim
Eric Kim
Laura Heacock
Ujas N Parikh
Linda Moy
Dong Wang
254
237
0
21 Mar 2017
Cross-modal Deep Metric Learning with Multi-task Regularization
Cross-modal Deep Metric Learning with Multi-task Regularization
Xin Huang
Yuxin Peng
62
18
0
21 Mar 2017
Learning Robust Visual-Semantic Embeddings
Learning Robust Visual-Semantic Embeddings
Yifan Hao
Liang-Kang Huang
Ruslan Salakhutdinov
SSLAI4TS
139
180
0
17 Mar 2017
Sensor Fusion for Robot Control through Deep Reinforcement Learning
Sensor Fusion for Robot Control through Deep Reinforcement Learning
Steven Bohez
Tim Verbelen
E. D. Coninck
B. Vankeirsbilck
Pieter Simoens
Bart Dhoedt
SSL
101
29
0
13 Mar 2017
Convolutional Spike Timing Dependent Plasticity based Feature Learning
  in Spiking Neural Networks
Convolutional Spike Timing Dependent Plasticity based Feature Learning in Spiking Neural Networks
Priyadarshini Panda
G. Srinivasan
Kaushik Roy
62
17
0
10 Mar 2017
RGB-D Salient Object Detection Based on Discriminative Cross-modal Transfer Learning
Hao Chen
Yangfu Li
Jane Polak Scowcroft
75
2
0
01 Mar 2017
Analyzing Modular CNN Architectures for Joint Depth Prediction and
  Semantic Segmentation
Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2017
O. Jafari
Oliver Groth
Alexander Kirillov
M. Yang
Carsten Rother
3DV
104
65
0
26 Feb 2017
Domain Adaptation for Visual Applications: A Comprehensive Survey
Domain Adaptation for Visual Applications: A Comprehensive Survey
G. Csurka
OOD
295
536
0
17 Feb 2017
Deep Heterogeneous Feature Fusion for Template-Based Face Recognition
Deep Heterogeneous Feature Fusion for Template-Based Face RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2017
Navaneeth Bodla
Jingxiao Zheng
Hongyu Xu
Jun-Cheng Chen
Carlos D. Castillo
Rama Chellappa
CVBM
234
66
0
15 Feb 2017
Gated Multimodal Units for Information Fusion
Gated Multimodal Units for Information FusionInternational Conference on Learning Representations (ICLR), 2017
John Arevalo
Thamar Solorio
Manuel Montes-y-Gómez
Fabio Gonzalez
501
452
0
07 Feb 2017
PCA-Initialized Deep Neural Networks Applied To Document Image Analysis
PCA-Initialized Deep Neural Networks Applied To Document Image AnalysisIEEE International Conference on Document Analysis and Recognition (ICDAR), 2017
Mathias Seuret
Michele Alberti
Rolf Ingold
Marcus Liwicki
ODL
96
56
0
01 Feb 2017
End-To-End Visual Speech Recognition With LSTMs
End-To-End Visual Speech Recognition With LSTMsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2017
Stavros Petridis
Zuwei Li
Maja Pantic
VLM
114
119
0
20 Jan 2017
Fusion of Heterogeneous Data in Convolutional Networks for Urban
  Semantic Labeling (Invited Paper)
Fusion of Heterogeneous Data in Convolutional Networks for Urban Semantic Labeling (Invited Paper)Joint Urban Remote Sensing Event (JURSE), 2017
Nicolas Audebert
Bertrand Le Saux
Sébastien Lefèvre
67
35
0
20 Jan 2017
Image Generation and Editing with Variational Info Generative
  AdversarialNetworks
Image Generation and Editing with Variational Info Generative AdversarialNetworks
Mahesh Gorijala
Ambedkar Dukkipati
GAN
91
15
0
17 Jan 2017
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and
  Lipreading
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading
Chunlin Tian
Weijun Ji
47
7
0
16 Jan 2017
Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic
  Speech Recognition
Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic Speech Recognition
Abhinav Thanda
S. Venkatesan
CVBM
92
29
0
10 Jan 2017
Deep learning for plasma tomography using the bolometer system at JET
Deep learning for plasma tomography using the bolometer system at JET
F. Matos
D. R. Ferreira
P. Carvalho
JET Contributors
103
47
0
02 Jan 2017
A Deep Learning Approach To Multiple Kernel Fusion
A Deep Learning Approach To Multiple Kernel FusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2016
Huan Song
Jayaraman J. Thiagarajan
P. Sattigeri
Karthikeyan N. Ramamurthy
A. Spanias
168
18
0
28 Dec 2016
DeMIAN: Deep Modality Invariant Adversarial Network
DeMIAN: Deep Modality Invariant Adversarial Network
Kuniaki Saito
Yusuke Mukuta
Yoshitaka Ushiku
Tatsuya Harada
VLMGAN
127
5
0
23 Dec 2016
Neural networks based EEG-Speech Models
Neural networks based EEG-Speech Models
Pengfei Sun
Jun Qin
75
27
0
16 Dec 2016
Deep Learning of Robotic Tasks without a Simulator using Strong and Weak
  Human Supervision
Deep Learning of Robotic Tasks without a Simulator using Strong and Weak Human Supervision
Bar Hilleli
Ran El-Yaniv
91
2
0
04 Dec 2016
Joint Visual Denoising and Classification using Deep Learning
Joint Visual Denoising and Classification using Deep Learning
Gang Chen
Yawei Li
S. Srihari
78
10
0
04 Dec 2016
Is a picture worth a thousand words? A Deep Multi-Modal Fusion
  Architecture for Product Classification in e-commerce
Is a picture worth a thousand words? A Deep Multi-Modal Fusion Architecture for Product Classification in e-commerce
Tom Zahavy
Alessandro Magnani
Abhinandan Krishnan
Shie Mannor
131
67
0
29 Nov 2016
Training an Interactive Humanoid Robot Using Multimodal Deep
  Reinforcement Learning
Training an Interactive Humanoid Robot Using Multimodal Deep Reinforcement Learning
Heriberto Cuayáhuitl
G. Couly
Clément Olalainty
103
3
0
26 Nov 2016
Robust end-to-end deep audiovisual speech recognition
Robust end-to-end deep audiovisual speech recognition
Ramon Sanabria
Florian Metze
Fernando de la Torre
121
7
0
21 Nov 2016
Joint Network based Attention for Action Recognition
Joint Network based Attention for Action Recognition
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
100
8
0
16 Nov 2016
Multi-view Recurrent Neural Acoustic Word Embeddings
Multi-view Recurrent Neural Acoustic Word Embeddings
Wanjia He
Weiran Wang
Karen Livescu
178
88
0
14 Nov 2016
Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Abhinav Thanda
S. Venkatesan
96
25
0
09 Nov 2016
Multispectral Deep Neural Networks for Pedestrian Detection
Multispectral Deep Neural Networks for Pedestrian Detection
Jingjing Liu
Shaoting Zhang
Shu Wang
Dimitris N. Metaxas
3DH
120
403
0
08 Nov 2016
Multi-view Generative Adversarial Networks
Multi-view Generative Adversarial Networks
Mickaël Chen
Ludovic Denoyer
GAN
136
31
0
07 Nov 2016
Joint Multimodal Learning with Deep Generative Models
Joint Multimodal Learning with Deep Generative Models
Masahiro Suzuki
Kotaro Nakayama
Y. Matsuo
DRLGAN
194
238
0
07 Nov 2016
LipNet: End-to-End Sentence-level Lipreading
LipNet: End-to-End Sentence-level Lipreading
Yannis Assael
Brendan Shillingford
Shimon Whiteson
Nando de Freitas
233
439
0
05 Nov 2016
Previous
123...14151617
Next