ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.04856
  4. Cited By
Multimodal Deep Learning

Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
ArXiv (abs)PDFHTML

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown
Doppler Spectrum Classification with CNNs via Heatmap Location Encoding
  and a Multi-head Output Layer
Doppler Spectrum Classification with CNNs via Heatmap Location Encoding and a Multi-head Output Layer
A. Gilbert
M. Holden
L. Eikvil
Mariia Rakhmail
Aleksandar Babić
S. Aase
E. Samset
K. Mcleod
111
2
0
06 Nov 2019
Attributed Sequence Embedding
Attributed Sequence Embedding
Zhongfang Zhuang
Xiangnan Kong
Elke A. Rundensteiner
Jihane Zouaoui
Aditya Arora
302
13
0
03 Nov 2019
Co-Generation with GANs using AIS based HMC
Co-Generation with GANs using AIS based HMCNeural Information Processing Systems (NeurIPS), 2019
Tiantian Fang
Alex Schwing
140
2
0
31 Oct 2019
CTNN: Corticothalamic-inspired neural network
CTNN: Corticothalamic-inspired neural network
Leendert A. Remmelzwaal
A. Mishra
George F. R. Ellis
OOD
183
1
0
28 Oct 2019
Twins Recognition Using Hierarchical Score Level Fusion
Twins Recognition Using Hierarchical Score Level Fusion
Cihan Akin
Umit Kacar
M. Kirci
89
2
0
27 Oct 2019
Autoencoding with a Classifier System
Autoencoding with a Classifier SystemIEEE Transactions on Evolutionary Computation (TEVC), 2019
R. Preen
Stewart W. Wilson
Larry Bull
AI4CE
308
14
0
23 Oct 2019
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image
  Synthesis
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image Synthesis
Jorge Agnese
Jonathan Herrera
Haicheng Tao
Xingquan Zhu
EGVM
169
114
0
21 Oct 2019
Multi-modal Deep Analysis for Multimedia
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Eric Wang
Hongzhi Li
219
49
0
11 Oct 2019
Multimodal representation models for prediction and control from partial
  information
Multimodal representation models for prediction and control from partial information
Martina Zambelli
Antoine Cully
Y. Demiris
133
30
0
09 Oct 2019
Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints
Cross-Modal Subspace Learning with Scheduled Adaptive Margin ConstraintsACM Multimedia (ACM MM), 2019
David Semedo
João Magalhães
137
11
0
30 Sep 2019
CNN-based RGB-D Salient Object Detection: Learn, Select and Fuse
CNN-based RGB-D Salient Object Detection: Learn, Select and Fuse
Hao Chen
Youfu Li
ObjD
118
24
0
20 Sep 2019
HyperLearn: A Distributed Approach for Representation Learning in
  Datasets With Many Modalities
HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many ModalitiesACM Multimedia (ACM MM), 2019
Devanshu Arya
Stevan Rudinac
Marcel Worring
147
11
0
19 Sep 2019
Multimodal Multitask Representation Learning for Pathology Biobank
  Metadata Prediction
Multimodal Multitask Representation Learning for Pathology Biobank Metadata Prediction
W. Weng
Yuannan Cai
Angela Lin
Fraser Tan
Po-Hsuan Cameron Chen
135
22
0
17 Sep 2019
Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal
  Representations
Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal RepresentationsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2019
Rogerio Bonatti
Ratnesh Madaan
Vibhav Vineet
Sebastian Scherer
Ashish Kapoor
SSL
203
45
0
16 Sep 2019
Joint Wasserstein Autoencoders for Aligning Multimodal Embeddings
Joint Wasserstein Autoencoders for Aligning Multimodal Embeddings
Shweta Mahajan
Teresa Botschen
Iryna Gurevych
Stefan Roth
97
8
0
14 Sep 2019
Co-Attentive Cross-Modal Deep Learning for Medical Evidence Synthesis
  and Decision Making
Co-Attentive Cross-Modal Deep Learning for Medical Evidence Synthesis and Decision Making
Devin Taylor
Simeon E. Spasov
Pietro Lio
120
6
0
13 Sep 2019
Recognizing Object Affordances to Support Scene Reasoning for
  Manipulation Tasks
Recognizing Object Affordances to Support Scene Reasoning for Manipulation Tasks
Fu-Jen Chu
Ruinian Xu
Chao Tang
Patricio A. Vela
126
7
0
12 Sep 2019
Using Clinical Notes with Time Series Data for ICU Management
Using Clinical Notes with Time Series Data for ICU ManagementConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Swaraj Khadanga
Karan Aggarwal
Shafiq Joty
Jaideep Srivastava
141
71
0
12 Sep 2019
MIDI-Sandwich2: RNN-based Hierarchical Multi-modal Fusion Generation VAE
  networks for multi-track symbolic music generation
MIDI-Sandwich2: RNN-based Hierarchical Multi-modal Fusion Generation VAE networks for multi-track symbolic music generation
X. Liang
Junmin Wu
Jing Cao
MGen
776
20
0
08 Sep 2019
Learning Alignment for Multimodal Emotion Recognition from Speech
Learning Alignment for Multimodal Emotion Recognition from SpeechInterspeech (Interspeech), 2019
Haiyang Xu
Hui Zhang
Kun Han
Yun Wang
Yiping Peng
Xiangang Li
157
145
0
06 Sep 2019
Denoising Auto-encoding Priors in Undecimated Wavelet Domain for MR
  Image Reconstruction
Denoising Auto-encoding Priors in Undecimated Wavelet Domain for MR Image Reconstruction
Siyuan Wang
Junjie Lv
Yuanyuan Hu
Dong Liang
Minghui Zhang
Qiegen Liu
MedIm
190
15
0
03 Sep 2019
Online Sensor Hallucination via Knowledge Distillation for Multimodal
  Image Classification
Online Sensor Hallucination via Knowledge Distillation for Multimodal Image Classification
S. Kumar
Biplab Banerjee
S. Chaudhuri
161
5
0
28 Aug 2019
A Data-Efficient Deep Learning Approach for Deployable Multimodal Social
  Robots
A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots
Heriberto Cuayáhuitl
OffRL
106
19
0
27 Aug 2019
A unified representation network for segmentation with missing
  modalities
A unified representation network for segmentation with missing modalities
Kenneth Lau
J. Adler
Jens Sjölund
91
27
0
19 Aug 2019
Harmonized Multimodal Learning with Gaussian Process Latent Variable
  Models
Harmonized Multimodal Learning with Gaussian Process Latent Variable ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Guoli Song
Shuhui Wang
Qingming Huang
Q. Tian
182
24
0
14 Aug 2019
Multimodal Emotion Recognition Using Deep Canonical Correlation Analysis
Multimodal Emotion Recognition Using Deep Canonical Correlation Analysis
Wei Liu
Jielin Qiu
Wei-Long Zheng
Bao-Liang Lu
153
80
0
13 Aug 2019
Deep Structured Cross-Modal Anomaly Detection
Deep Structured Cross-Modal Anomaly DetectionIEEE International Joint Conference on Neural Network (IJCNN), 2019
Yuening Li
Ninghao Liu
Jundong Li
Mengnan Du
Helen Zhou
233
18
0
11 Aug 2019
Simultaneous Semantic Segmentation and Outlier Detection in Presence of
  Domain Shift
Simultaneous Semantic Segmentation and Outlier Detection in Presence of Domain ShiftGerman Conference on Pattern Recognition (DAGM), 2019
Petra Bevandić
Ivan Kreso
Marin Orsic
Sinisa Segvic
154
86
0
03 Aug 2019
DELTA: A DEep learning based Language Technology plAtform
DELTA: A DEep learning based Language Technology plAtform
Kun Han
Junwen Chen
Hui Zhang
Haiyang Xu
Yiping Peng
...
Cheng Gong
Yunbo Wang
Wei Zou
Hui Song
Xiangang Li
VLM
98
10
0
02 Aug 2019
Machine Learning at the Network Edge: A Survey
Machine Learning at the Network Edge: A SurveyACM Computing Surveys (ACM CSUR), 2019
M. G. Sarwar Murshed
Chris Murphy
Daqing Hou
Nazar Khan
Ganesh Ananthanarayanan
Faraz Hussain
659
463
0
31 Jul 2019
Making Sense of Vision and Touch: Learning Multimodal Representations
  for Contact-Rich Tasks
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich TasksIEEE Transactions on robotics (TRO), 2019
Michelle A. Lee
Yuke Zhu
Peter Zachares
Matthew Tan
K. Srinivasan
Silvio Savarese
Fei-Fei Li
Animesh Garg
Jeannette Bohg
SSL
248
247
0
28 Jul 2019
Production Ranking Systems: A Review
Production Ranking Systems: A Review
M. Iqbal
Nishan Subedi
Kamelia Aryafar
AI4TS
127
3
0
24 Jul 2019
Multisensory Learning Framework for Robot Drumming
Multisensory Learning Framework for Robot Drumming
A. Barsky
Claudio Zito
Hiroki Mori
Tetsuya Ogata
J. Wyatt
87
7
0
23 Jul 2019
Shared Generative Latent Representation Learning for Multi-view
  Clustering
Shared Generative Latent Representation Learning for Multi-view ClusteringAAAI Conference on Artificial Intelligence (AAAI), 2019
Ming Yin
Weitian Huang
Junbin Gao
130
75
0
23 Jul 2019
OmniNet: A unified architecture for multi-modal multi-task learning
OmniNet: A unified architecture for multi-modal multi-task learning
Subhojeet Pramanik
Priyanka Agrawal
A. Hussain
156
45
0
17 Jul 2019
retina-VAE: Variationally Decoding the Spectrum of Macular Disease
retina-VAE: Variationally Decoding the Spectrum of Macular Disease
Stephen G. Odaibo
54
3
0
11 Jul 2019
Deep Coupled-Representation Learning for Sparse Linear Inverse Problems
  with Side Information
Deep Coupled-Representation Learning for Sparse Linear Inverse Problems with Side InformationIEEE Signal Processing Letters (SPL), 2019
Evaggelia Tsiligianni
Nikos Deligiannis
165
19
0
04 Jul 2019
Probabilistic CCA with Implicit Distributions
Probabilistic CCA with Implicit Distributions
Yaxin Shi
Yuangang Pan
Donna Xu
Ivor Tsang
54
0
0
04 Jul 2019
Cascade Attention Guided Residue Learning GAN for Cross-Modal
  Translation
Cascade Attention Guided Residue Learning GAN for Cross-Modal TranslationInternational Conference on Pattern Recognition (ICPR), 2019
Bin Duan
Wei Wang
Hao Tang
Hugo Latapie
Yan Yan
321
38
0
03 Jul 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory
Deep Gamblers: Learning to Abstain with Portfolio TheoryNeural Information Processing Systems (NeurIPS), 2019
Liu Ziyin
Zhikang T. Wang
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Masahito Ueda
301
124
0
29 Jun 2019
RFBNet: Deep Multimodal Networks with Residual Fusion Blocks for RGB-D
  Semantic Segmentation
RFBNet: Deep Multimodal Networks with Residual Fusion Blocks for RGB-D Semantic Segmentation
Liuyuan Deng
Ming Yang
Tianyi Li
Yuesheng He
Chunxiang Wang
237
70
0
29 Jun 2019
Lipper: Synthesizing Thy Speech using Multi-View Lipreading
Lipper: Synthesizing Thy Speech using Multi-View LipreadingAAAI Conference on Artificial Intelligence (AAAI), 2019
Yaman Kumar Singla
Rohit Jain
Khwaja Mohd. Salik
R. Shah
Yifang Yin
Roger Zimmermann
171
43
0
28 Jun 2019
Task-Driven Common Representation Learning via Bridge Neural Network
Task-Driven Common Representation Learning via Bridge Neural NetworkAAAI Conference on Artificial Intelligence (AAAI), 2019
Yao Xu
Xueshuang Xiang
Meiyu Huang
SSL
150
5
0
26 Jun 2019
Learning as the Unsupervised Alignment of Conceptual Systems
Learning as the Unsupervised Alignment of Conceptual SystemsNature Machine Intelligence (NMI), 2019
Brett D. Roads
Bradley C. Love
OCL
226
49
0
21 Jun 2019
Deep RGB-D Canonical Correlation Analysis For Sparse Depth Completion
Deep RGB-D Canonical Correlation Analysis For Sparse Depth CompletionNeural Information Processing Systems (NeurIPS), 2019
Yiqi Zhong
Cho-Ying Wu
Suya You
Ulrich Neumann
3DV
145
40
0
21 Jun 2019
Connecting Touch and Vision via Cross-Modal Prediction
Connecting Touch and Vision via Cross-Modal PredictionComputer Vision and Pattern Recognition (CVPR), 2019
Yunzhu Li
Jun-Yan Zhu
Russ Tedrake
Antonio Torralba
198
157
0
14 Jun 2019
Speaker-Targeted Audio-Visual Models for Speech Recognition in
  Cocktail-Party Environments
Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party EnvironmentsInterspeech (Interspeech), 2016
Guan-Lin Chao
William Chan
Ian Lane
133
15
0
13 Jun 2019
Representation Learning for Words and Entities
Representation Learning for Words and Entities
Pushpendre Rastogi
SSL
182
0
0
12 Jun 2019
Federated AI lets a team imagine together: Federated Learning of GANs
Federated AI lets a team imagine together: Federated Learning of GANsInternational Journal of Computer Science and Engineering (IJCSE), 2019
R. A
N. V
FedML
97
7
0
09 Jun 2019
Visually Grounded Neural Syntax Acquisition
Visually Grounded Neural Syntax AcquisitionAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Freda Shi
Jiayuan Mao
Kevin Gimpel
Karen Livescu
NAI
212
85
0
07 Jun 2019
Previous
123...91011...151617
Next
Page 10 of 17
Pageof 17