Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011

12 January 2023

Christopher Marquardt

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown

Multi-view Generative Adversarial Networks

Mickaël Chen

Ludovic Denoyer

GAN

185

07 Nov 2016

Joint Multimodal Learning with Deep Generative Models

269

244

07 Nov 2016

LipNet: End-to-End Sentence-level Lipreading

334

449

05 Nov 2016

Ways of Conditioning Generative Adversarial Networks

Hanock Kwak

Byoung-Tak Zhang

GAN

102

04 Nov 2016

CB2CF: A Neural Multiview Content-to-Collaborative Filtering Model for Completely Cold Item Recommendations

330

01 Nov 2016

Deep fusion of visual signatures for client-server facial analysis

Binod Bhattarai

Gaurav Sharma

F. Jurie

290

01 Nov 2016

Cross-Modal Scene Networks

Carl Vondrick

Antonio Torralba

180

117

27 Oct 2016

SoundNet: Learning Sound Representations from Unlabeled Video

Y. Aytar

Carl Vondrick

Antonio Torralba

SSL

325

1,089

27 Oct 2016

Deep Variational Canonical Correlation Analysis

276

151

11 Oct 2016

A Survey of Multi-View Representation Learning

648

587

03 Oct 2016

Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks

Nicolas Audebert

Bertrand Le Saux

Sébastien Lefèvre

SSeg

217

393

22 Sep 2016

GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for Multimodal Information Fusion

100

17 Sep 2016

Linking Image and Text with 2-Way NetsComputer Vision and Pattern Recognition (CVPR), 2016

Aviv Eisenschtat

Lior Wolf

301

182

29 Aug 2016

Learning Common and Specific Features for RGB-D Semantic Segmentation with Deconvolutional Networks

165

159

03 Aug 2016

Learning Aligned Cross-Modal Representations from Weakly Aligned Data

Carl Vondrick

Antonio Torralba

171

177

25 Jul 2016

Deep Appearance Models: A Deep Boltzmann Machine Approach for Face Modeling

182

23 Jul 2016

A Comprehensive Survey on Cross-modal Retrieval

Liang Wang

189

320

21 Jul 2016

Coupled Generative Adversarial NetworksNeural Information Processing Systems (NeurIPS), 2016

Ming-Yuan Liu

Oncel Tuzel

OOD GAN

560

1,699

24 Jun 2016

Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions

136

23 Jun 2016

Multi-Modal Hybrid Deep Neural Network for Speech Enhancement

108

15 Jun 2016

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

768

2,275

07 Jun 2016

Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2016

629

1,548

06 Jun 2016

Multimodal Residual Learning for Visual QANeural Information Processing Systems (NeurIPS), 2016

294

315

05 Jun 2016

Generative Adversarial Text to Image Synthesis

Bernt Schiele

503

3,341

17 May 2016

Learning Deep Representations of Fine-grained Visual Descriptions

Bernt Schiele

425

893

17 May 2016

Multimodal Sparse Coding for Event Detection

118

17 May 2016

Convolutional Neural Networks For Automatic State-Time Feature Extraction in Reinforcement Learning Applied to Residential Load Control

Bert Claessens

Peter Vrancx

F. Ruelens

151

129

28 Apr 2016

Image Captioning with Deep Bidirectional LSTMs

Cheng Wang

218

294

04 Apr 2016

Colorful Image Colorization

Richard Y. Zhang

Phillip Isola

Alexei A. Efros

628

3,670

28 Mar 2016

Global-Local Face Upsampling Network

183

23 Mar 2016

Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos

271

242

23 Mar 2016

Deep Learning in Bioinformatics

383

1,433

21 Mar 2016

Variational methods for Conditional Multimodal Deep Learning

Gaurav Pandey

Ambedkar Dukkipati

GAN DRL CVBM

104

06 Mar 2016

Learning deep representation of multityped objects and tasks

04 Mar 2016

Measuring and Predicting Tag Importance for Image Retrieval

306

28 Feb 2016

Multimodal Emotion Recognition Using Multimodal Deep Learning

Wen Liu

Wei-Long Zheng

Bao-Liang Lu

118

26 Feb 2016

Unsupervised Domain Adaptation with Residual Transfer Networks

282

1,576

14 Feb 2016

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification

167

111

13 Feb 2016

On Deep Multi-View Representation Learning: Objectives and Optimization

338

1,003

02 Feb 2016

Matrix Neural Networks

Junbin Gao

Yi Guo

Zhiyong Wang

150

15 Jan 2016

Robobarista: Learning to Manipulate Novel Objects via Deep Multimodal Embedding

202

12 Jan 2016

Brain4Cars: Car That Knows Before You Do via Sensory-Fusion Deep Learning Architecture

193

137

05 Jan 2016

Common Variable Learning and Invariant Representation Learning using Siamese Neural Networks

Uri Shaham

Roy R. Lederman

SSL

146

29 Dec 2015

Visually Indicated Sounds

Antonio Torralba

William T. Freeman

372

408

28 Dec 2015

A C++ library for Multimodal Deep Learning

Jian Jin

VLM

215

22 Dec 2015

Attribute2Image: Conditional Image Generation from Visual Attributes

332

795

02 Dec 2015

Learning with Memory Embeddings

545

25 Nov 2015

Patterns for Learning with Side Information

309

19 Nov 2015

Multimodal sparse representation learning and applications

Miriam Cha

Youngjune Gwon

H. T. Kung

192

19 Nov 2015

Learning Deep Structure-Preserving Image-Text Embeddings

Liwei Wang

Yin Li

Svetlana Lazebnik

483

822

19 Nov 2015