ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.04856
  4. Cited By
Multimodal Deep Learning

Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
ArXiv (abs)PDFHTML

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown
Multi-view Generative Adversarial Networks
Multi-view Generative Adversarial Networks
Mickaël Chen
Ludovic Denoyer
GAN
185
31
0
07 Nov 2016
Joint Multimodal Learning with Deep Generative Models
Joint Multimodal Learning with Deep Generative Models
Masahiro Suzuki
Kotaro Nakayama
Y. Matsuo
DRLGAN
269
244
0
07 Nov 2016
LipNet: End-to-End Sentence-level Lipreading
LipNet: End-to-End Sentence-level Lipreading
Yannis Assael
Brendan Shillingford
Shimon Whiteson
Nando de Freitas
334
449
0
05 Nov 2016
Ways of Conditioning Generative Adversarial Networks
Ways of Conditioning Generative Adversarial Networks
Hanock Kwak
Byoung-Tak Zhang
GAN
102
17
0
04 Nov 2016
CB2CF: A Neural Multiview Content-to-Collaborative Filtering Model for
  Completely Cold Item Recommendations
CB2CF: A Neural Multiview Content-to-Collaborative Filtering Model for Completely Cold Item Recommendations
Oren Barkan
Noam Koenigstein
E. Yogev
Ori Katz
VLM
330
4
0
01 Nov 2016
Deep fusion of visual signatures for client-server facial analysis
Deep fusion of visual signatures for client-server facial analysis
Binod Bhattarai
Gaurav Sharma
F. Jurie
290
3
0
01 Nov 2016
Cross-Modal Scene Networks
Cross-Modal Scene Networks
Y. Aytar
Lluis Castrejon
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
SSL
180
117
0
27 Oct 2016
SoundNet: Learning Sound Representations from Unlabeled Video
SoundNet: Learning Sound Representations from Unlabeled Video
Y. Aytar
Carl Vondrick
Antonio Torralba
SSL
325
1,089
0
27 Oct 2016
Deep Variational Canonical Correlation Analysis
Deep Variational Canonical Correlation Analysis
Weiran Wang
Xinchen Yan
Honglak Lee
Karen Livescu
DRLBDL
276
151
0
11 Oct 2016
A Survey of Multi-View Representation Learning
A Survey of Multi-View Representation Learning
Yingming Li
Ming Yang
Zhongfei Zhang
AI4TS3DV
648
587
0
03 Oct 2016
Semantic Segmentation of Earth Observation Data Using Multimodal and
  Multi-scale Deep Networks
Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks
Nicolas Audebert
Bertrand Le Saux
Sébastien Lefèvre
SSeg
217
393
0
22 Sep 2016
GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for
  Multimodal Information Fusion
GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for Multimodal Information Fusion
Ankit Gandhi
Arjun Sharma
Arijit Biswas
Om Deshmukh
AI4TS
100
13
0
17 Sep 2016
Linking Image and Text with 2-Way Nets
Linking Image and Text with 2-Way NetsComputer Vision and Pattern Recognition (CVPR), 2016
Aviv Eisenschtat
Lior Wolf
301
182
0
29 Aug 2016
Learning Common and Specific Features for RGB-D Semantic Segmentation
  with Deconvolutional Networks
Learning Common and Specific Features for RGB-D Semantic Segmentation with Deconvolutional Networks
Jinghua Wang
Zhenhua Wang
Dacheng Tao
Simon See
G. Wang
MDE
165
159
0
03 Aug 2016
Learning Aligned Cross-Modal Representations from Weakly Aligned Data
Learning Aligned Cross-Modal Representations from Weakly Aligned Data
Lluis Castrejon
Y. Aytar
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
SSLDRLAI4TS
171
177
0
25 Jul 2016
Deep Appearance Models: A Deep Boltzmann Machine Approach for Face
  Modeling
Deep Appearance Models: A Deep Boltzmann Machine Approach for Face Modeling
C. Duong
Khoa Luu
Kha Gia Quach
Tien D. Bui
CVBM
182
31
0
23 Jul 2016
A Comprehensive Survey on Cross-modal Retrieval
A Comprehensive Survey on Cross-modal Retrieval
Jen-tse Huang
Qiyue Yin
Wei Wang
Shu Wu
Liang Wang
189
320
0
21 Jul 2016
Coupled Generative Adversarial Networks
Coupled Generative Adversarial NetworksNeural Information Processing Systems (NeurIPS), 2016
Ming-Yuan Liu
Oncel Tuzel
OODGAN
560
1,699
0
24 Jun 2016
Picture It In Your Mind: Generating High Level Visual Representations
  From Textual Descriptions
Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions
F. Carrara
Andrea Esuli
T. Fagni
Fabrizio Falchi
Alejandro Moreo
DiffM
136
31
0
23 Jun 2016
Multi-Modal Hybrid Deep Neural Network for Speech Enhancement
Multi-Modal Hybrid Deep Neural Network for Speech Enhancement
Zhenzhou Wu
S. Sivadas
Yong Kiam Tan
Ma Bin
Rick Siow Mong Goh
108
17
0
15 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic
  Segmentation
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
768
2,275
0
07 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2016
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
629
1,548
0
06 Jun 2016
Multimodal Residual Learning for Visual QA
Multimodal Residual Learning for Visual QANeural Information Processing Systems (NeurIPS), 2016
Jin-Hwa Kim
Sang-Woo Lee
Donghyun Kwak
Min-Oh Heo
Jeonghee Kim
Jung-Woo Ha
Byoung-Tak Zhang
294
315
0
05 Jun 2016
Generative Adversarial Text to Image Synthesis
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
503
3,341
0
17 May 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCLVLM
425
893
0
17 May 2016
Multimodal Sparse Coding for Event Detection
Multimodal Sparse Coding for Event Detection
Youngjune Gwon
William Campbell
K. Brady
D. Sturim
Miriam Cha
H. T. Kung
118
6
0
17 May 2016
Convolutional Neural Networks For Automatic State-Time Feature
  Extraction in Reinforcement Learning Applied to Residential Load Control
Convolutional Neural Networks For Automatic State-Time Feature Extraction in Reinforcement Learning Applied to Residential Load Control
Bert Claessens
Peter Vrancx
F. Ruelens
151
129
0
28 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
218
294
0
04 Apr 2016
Colorful Image Colorization
Colorful Image Colorization
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
628
3,670
0
28 Mar 2016
Global-Local Face Upsampling Network
Global-Local Face Upsampling Network
Oncel Tuzel
Yuichi Taguchi
J. Hershey
CVBMSupR
183
37
0
23 Mar 2016
Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos
Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos
Amir Shahroudy
T. Ng
Yihong Gong
G. Wang
271
242
0
23 Mar 2016
Deep Learning in Bioinformatics
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE3DV
383
1,433
0
21 Mar 2016
Variational methods for Conditional Multimodal Deep Learning
Variational methods for Conditional Multimodal Deep Learning
Gaurav Pandey
Ambedkar Dukkipati
GANDRLCVBM
104
15
0
06 Mar 2016
Learning deep representation of multityped objects and tasks
Learning deep representation of multityped objects and tasks
T. Tran
Dinh Q. Phung
Svetha Venkatesh
AI4TS
80
1
0
04 Mar 2016
Measuring and Predicting Tag Importance for Image Retrieval
Measuring and Predicting Tag Importance for Image Retrieval
Shangwen Li
S. Purushotham
Chen Chen
Yuzhuo Ren
C.-C. Jay Kuo
306
32
0
28 Feb 2016
Multimodal Emotion Recognition Using Multimodal Deep Learning
Multimodal Emotion Recognition Using Multimodal Deep Learning
Wen Liu
Wei-Long Zheng
Bao-Liang Lu
118
62
0
26 Feb 2016
Unsupervised Domain Adaptation with Residual Transfer Networks
Unsupervised Domain Adaptation with Residual Transfer Networks
Mingsheng Long
Hanjing Zhu
Jianmin Wang
Sai Li
OOD
282
1,576
0
14 Feb 2016
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification
Jimmy S. J. Ren
Yongtao Hu
Yu-Wing Tai
Chuan Wang
Kepeng Xu
Wenxiu Sun
Qiong Yan
167
111
0
13 Feb 2016
On Deep Multi-View Representation Learning: Objectives and Optimization
On Deep Multi-View Representation Learning: Objectives and Optimization
Weiran Wang
R. Arora
Karen Livescu
J. Bilmes
SSLDRL
338
1,003
0
02 Feb 2016
Matrix Neural Networks
Matrix Neural Networks
Junbin Gao
Yi Guo
Zhiyong Wang
150
28
0
15 Jan 2016
Robobarista: Learning to Manipulate Novel Objects via Deep Multimodal
  Embedding
Robobarista: Learning to Manipulate Novel Objects via Deep Multimodal Embedding
Jaeyong Sung
Seok Hyun Jin
Ian Lenz
Ashutosh Saxena
LM&Ro
202
16
0
12 Jan 2016
Brain4Cars: Car That Knows Before You Do via Sensory-Fusion Deep
  Learning Architecture
Brain4Cars: Car That Knows Before You Do via Sensory-Fusion Deep Learning Architecture
Ashesh Jain
H. Koppula
Shane Soh
Bharad Raghavan
Avi Singh
Ashutosh Saxena
193
137
0
05 Jan 2016
Common Variable Learning and Invariant Representation Learning using
  Siamese Neural Networks
Common Variable Learning and Invariant Representation Learning using Siamese Neural Networks
Uri Shaham
Roy R. Lederman
SSL
146
3
0
29 Dec 2015
Visually Indicated Sounds
Visually Indicated Sounds
Andrew Owens
Phillip Isola
Josh H. McDermott
Antonio Torralba
Edward H. Adelson
William T. Freeman
372
408
0
28 Dec 2015
A C++ library for Multimodal Deep Learning
A C++ library for Multimodal Deep Learning
Jian Jin
VLM
215
1
0
22 Dec 2015
Attribute2Image: Conditional Image Generation from Visual Attributes
Attribute2Image: Conditional Image Generation from Visual Attributes
Xinchen Yan
Jimei Yang
Kihyuk Sohn
Honglak Lee
DRLGAN
332
795
0
02 Dec 2015
Learning with Memory Embeddings
Learning with Memory Embeddings
Volker Tresp
Cristóbal Esteban
Yinchong Yang
S. Baier
Denis Krompass
545
32
0
25 Nov 2015
Patterns for Learning with Side Information
Patterns for Learning with Side Information
Rico Jonschkowski
Sebastian Hofer
Oliver Brock
SSL
309
30
0
19 Nov 2015
Multimodal sparse representation learning and applications
Multimodal sparse representation learning and applications
Miriam Cha
Youngjune Gwon
H. T. Kung
192
16
0
19 Nov 2015
Learning Deep Structure-Preserving Image-Text Embeddings
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
483
822
0
19 Nov 2015
Previous
123...151617
Next
Page 16 of 17
Pageof 17