Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011

12 January 2023

Christopher Marquardt

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown

Deep Collective Matrix Factorization for Augmented Multi-View Learning

Ragunathan Mariappan

Vaibhav Rajan

150

28 Nov 2018

Uncertainty aware audiovisual activity recognition using deep Bayesian variational inference

182

27 Nov 2018

Cross-domain Deep Feature Combination for Bird Species Classification with Audio-visual Data

B. Naranchimeg

Chao Zhang

T. Akashi

26 Nov 2018

A Novel Technique for Evidence based Conditional Inference in Deep Neural Networks via Latent Feature Perturbation

206

24 Nov 2018

Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal BehaviorsAAAI Conference on Artificial Intelligence (AAAI), 2018

Yansen Wang

Louis-Philippe Morency

242

455

23 Nov 2018

Learning from Multiview Correlations in Open-Domain VideosIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018

Pranava Madhyastha

134

21 Nov 2018

Visual-Texual Emotion Analysis with Deep Coupled Video and Danmu Neural NetworksIEEE transactions on multimedia (TMM), 2018

116

19 Nov 2018

159

18 Nov 2018

Semi-supervised Deep Representation Learning for Multi-View Problems

Philip S. Yu

149

11 Nov 2018

Multi-Source Neural Variational Inference

Richard Kurle

Stephan Günnemann

Patrick van der Smagt

BDL SSL DRL

179

11 Nov 2018

Cross and Learn: Cross-Modal Self-SupervisionGerman Conference on Pattern Recognition (DAGM), 2018

250

09 Nov 2018

Multimodal One-Shot Learning of Speech and ImagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018

161

09 Nov 2018

Y^2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word SequencesAAAI Conference on Artificial Intelligence (AAAI), 2018

109

07 Nov 2018

Cogni-Net: Cognitive Feature Learning through Deep Visual Perception

184

01 Nov 2018

Software Engineering Challenges of Deep Learning

227

189

29 Oct 2018

Vehicle Tracking Using Surveillance with Multimodal Data Fusion

29 Oct 2018

Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features

320

150

25 Oct 2018

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

Silvio Savarese

Li Fei-Fei

Animesh Garg

Jeannette Bohg

SSL

257

406

24 Oct 2018

Dense Multimodal Fusion for Hierarchically Joint Representation

Di Hu

Feiping Nie

Xuelong Li

183

08 Oct 2018

Image and Encoded Text Fusion for Multi-Modal Classification

I. Gallo

Alessandro Calefati

Shah Nawaz

Muhammad Kamran Janjua

03 Oct 2018

Pixel and Feature Level Based Domain Adaption for Object Detection in Autonomous Driving

Yuhu Shan

W. Lu

C. Chew

172

30 Sep 2018

Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture

Stavros Petridis

Themos Stafylakis

Pingchuan Ma

Georgios Tzimiropoulos

Maja Pantic

175

151

28 Sep 2018

Vector Learning for Cross Domain RepresentationsInternational Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2017

27 Sep 2018

Machine Learning for Forecasting Mid Price Movement using Limit Order Book Data

Nikolaos Passalis

Anastasios Tefas

201

19 Sep 2018

Incomplete Multi-view Clustering via Graph Regularized Matrix Factorization

109

17 Sep 2018

End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models

Fei Tao

John H. L. Hansen

152

12 Sep 2018

Implicit Analysis of Perceptual Multimedia Experience Based on Physiological Response: A Review

Seong-Eun Moon

Jong-Seok Lee

12 Sep 2018

Using Sparse Semantic Embeddings Learned from Multimodal Text and Image Data to Model Human Conceptual Knowledge

226

07 Sep 2018

Multi-view Factorization AutoEncoder with Network Constraints for Multi-omic Integrative Analysis

Tianle Ma

A. Zhang

106

06 Sep 2018

Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition

George Sterpu

Christian Saam

N. Harte

294

05 Sep 2018

Multi-Adversarial Domain Adaptation

191

927

04 Sep 2018

Role of Intonation in Scoring Spoken English

Amber Nigam

Ishan Sodhi

Tuhinanksu Das

23 Aug 2018

LRMM: Learning to Recommend with Missing Modalities

Cheng Wang

Mathias Niepert

Hui Li

138

21 Aug 2018

Dynamic Temporal Alignment of Speech to Lips

Tavi Halperin

Ariel Ephrat

Shmuel Peleg

124

19 Aug 2018

Multimodal Deep Neural Networks using Both Engineered and Learned Representations for Biodegradability Prediction

149

13 Aug 2018

Multimodal Language Analysis with Recurrent Multistage Fusion

Paul Pu Liang

Liu Ziyin

Amir Zadeh

Louis-Philippe Morency

220

216

12 Aug 2018

Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data

179

27 Jul 2018

Visual Affordance and Function Understanding: A SurveyACM Computing Surveys (CSUR), 2018

Mohammed Hassanin

Salman Khan

M. Tahtali

175

18 Jul 2018

Robust Deep Multi-modal Learning Based on Gated Information Fusion NetworkAsian Conference on Computer Vision (ACCV), 2018

221

17 Jul 2018

A Multimodal Approach to Predict Social Media PopularityConference on Multimedia Information Processing and Retrieval (MIPR), 2018

137

16 Jul 2018

Object Detection with Deep Learning: A Review

584

4,478

15 Jul 2018

3D Hand Pose Estimation using Simulation and Partial-Supervision with a Shared Latent Space

14 Jul 2018

Large-Scale Visual Speech Recognition

...

272

165

13 Jul 2018

Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis

169

11 Jul 2018

Fuzzy Logic Interpretation of Quadratic Networks

Fenglei Fan

Ge Wang

213

04 Jul 2018

Harnessing AI for Speech Reconstruction using Multi-view Silent Video FeedACM Multimedia (ACM MM), 2018

Shiníchi Satoh

126

02 Jul 2018

Learning Visually-Grounded Semantics from Contrastive Adversarial SamplesInternational Conference on Computational Linguistics (COLING), 2018

Jiayuan Mao

177

27 Jun 2018

Disentangled VAE Representations for Multi-Aspect and Missing Data

164

24 Jun 2018

Learning Multimodal Representations for Unseen Activities

A. Piergiovanni

Michael S. Ryoo

SSL

218

21 Jun 2018

Multimodal Grounding for Language Processing

Lisa Beinborn

Teresa Botschen

Iryna Gurevych

159

17 Jun 2018