Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011

12 January 2023

Christopher Marquardt

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown

Learning Factorized Multimodal Representations

Yifan Hao

Paul Pu Liang

Amir Zadeh

Louis-Philippe Morency

Ruslan Salakhutdinov

DRL

276

500

16 Jun 2018

On Machine Learning and Structure for Mobile Robots

Markus Wulfmeier

130

15 Jun 2018

Deep Learning for Classification Tasks on Geospatial Vector Polygons

R. V. Veer

Peter Bloem

E. Folmer

196

11 Jun 2018

Learn to Combine Modalities in Multimodal Deep Learning

210

157

29 May 2018

More Than a Feeling: Learning to Grasp and Regrasp using Vision and Touch

331

373

28 May 2018

Unsupervised Learning for Trustworthy IoT

25 May 2018

Omega: An Architecture for AI Unification

Eray Özkural

AI4CE

134

16 May 2018

On Learning Associations of Faces and Voices

Changil Kim

Hijung Valentina Shin

220

15 May 2018

Learnable PINs: Cross-Modal Embeddings for Person Identity

202

162

02 May 2018

Investigations on End-to-End Audiovisual Fusion

Michael Wand

Ngoc Thang Vu

J. Schmidhuber

30 Apr 2018

A Bimodal Learning Approach to Assist Multi-sensory Effects Synchronization

R. Abreu

J. Santos

Eduardo Bezerra

28 Apr 2018

Multi Layered-Parallel Graph Convolutional Network (ML-PGCN) for Disease Prediction

Nassir Navab

116

28 Apr 2018

Multi-Modal Coreference Resolution with the Correlation between Space Structures

21 Apr 2018

Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events

180

19 Apr 2018

Multi-view Hybrid Embedding: A Divide-and-Conquer ApproachIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2018

169

19 Apr 2018

Deep Multimodal Subspace Clustering Networks

Mahdi Abavisani

Vishal M. Patel

304

180

17 Apr 2018

Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning

Xinze Wang

Yuan-fang Wang

William Yang Wang

139

15 Apr 2018

Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Andrew Owens

Alexei A. Efros

SSL

590

796

10 Apr 2018

The Sound of Pixels

Hang Zhao

Chuang Gan

Andrew Rouditchenko

Carl Vondrick

Josh H. McDermott

Antonio Torralba

VLM

415

575

09 Apr 2018

Mix and match networks: encoder-decoder alignment for zero-pair image translation

Yaxing Wang

Joost van de Weijer

Luis Herranz

106

06 Apr 2018

Unsupervised Correlation Analysis

Yedid Hoshen

Lior Wolf

117

01 Apr 2018

Cross-modal Deep Variational Hand Pose Estimation

Otmar Hilliges

198

304

30 Mar 2018

Audio-Visual Event Localization in Unconstrained Videos

Yapeng Tian

Jing Shi

Bochen Li

Zhiyao Duan

Chenliang Xu

358

532

23 Mar 2018

Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings

Silvio Savarese

198

270

22 Mar 2018

Acoustic feature learning using cross-domain articulatory measurements

Qingming Tang

Weiran Wang

Karen Livescu

126

19 Mar 2018

A Survey on Deep Learning Toolkits and Libraries for Intelligent User Interfaces

181

13 Mar 2018

Multimodal Recurrent Neural Networks with Information Transfer Layers for Indoor Scene LabelingIEEE transactions on multimedia (TMM), 2018

Lap-Pui Chau

141

13 Mar 2018

Deep Learning in Mobile and Wireless Networking: A SurveyIEEE Communications Surveys and Tutorials (COMST), 2018

Chaoyun Zhang

P. Patras

Hamed Haddadi

357

1,423

12 Mar 2018

A Hybrid Method for Traffic Flow Forecasting Using Multimodal Deep Learning

Tianrui Li

151

173

06 Mar 2018

Cross-Paced Representation Learning with Partial Curricula for Sketch-based Image Retrieval

Dan Xu

Xavier Alameda-Pineda

Jingkuan Song

Elisa Ricci

Andrii Zadaianchuk

SSL

108

05 Mar 2018

Indic Handwritten Script Identification using Offline-Online Multimodal Deep Network

215

23 Feb 2018

ViTac: Feature Sharing between Vision and Tactile Sensing for Cloth Texture Recognition

Shan Luo

198

153

21 Feb 2018

End-to-end Audiovisual Speech Recognition

Georgios Tzimiropoulos

Maja Pantic

218

276

18 Feb 2018

Exact and Consistent Interpretation for Piecewise Linear Neural Networks: A Closed Form Solution

Juhua Hu

158

105

17 Feb 2018

Multimodal Generative Models for Scalable Weakly-Supervised Learning

Mike Wu

Noah D. Goodman

DRL

328

436

14 Feb 2018

Attention-Based Guided Structured Sparsity of Deep Neural Networks

192

13 Feb 2018

Learning to score the figure skating sports videos

240

151

08 Feb 2018

Efficient Large-Scale Multi-Modal Classification

D. Kiela

Edouard Grave

Armand Joulin

Tomas Mikolov

175

174

06 Feb 2018

Personalized Machine Learning for Robot Perception of Affect and Engagement in Autism Therapy

Bjorn Schuller

153

289

04 Feb 2018

Real-world Multi-object, Multi-grasp Detection

Fu-Jen Chu

Ruinian Xu

Patricio A. Vela

166

01 Feb 2018

Deep Multi-view Learning to Rank

273

31 Jan 2018

Improving Bi-directional Generation between Different Modalities with Variational Autoencoders

26 Jan 2018

PDNet: Semantic Segmentation integrated with a Primal-Dual Network for Document binarization

K. R. Ayyalasomayajula

F. Malmberg

Anders Brun

210

26 Jan 2018

Deep Canonically Correlated LSTMs

Neil Rohit Mallinar

Corbin Rosset

16 Jan 2018

Cross-modal Embeddings for Video and Audio Retrieval

137

07 Jan 2018

An Order Preserving Bilinear Model for Person Detection in Multi-Modal DataIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2017

144

20 Dec 2017

Learning Sight from Sound: Ambient Sound Provides Supervision for Visual LearningInternational Journal of Computer Vision (IJCV), 2017

Andrew Owens

Jiajun Wu

Josh H. McDermott

William T. Freeman

Antonio Torralba

SSL

266

170

20 Dec 2017

A Survey on Multi-View Clustering

Guoqing Chao

Shiliang Sun

J. Bi

196

290

18 Dec 2017

Adversarial Attribute-Image Person Re-identification

Hong-Xing Yu

201

05 Dec 2017

Multimodal Storytelling via Generative Adversarial Imitation Learning

Zhiqian Chen

Xuchao Zhang

Arnold P. Boedihardjo

Jing Dai

Chang-Tien Lu

155

05 Dec 2017