ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.04856
  4. Cited By
Multimodal Deep Learning

Multimodal Deep Learning

International Conference on Machine Learning (ICML), 2011
12 January 2023
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
Mingyu Kim
Christopher Marquardt
Marco Moldovan
Nadja Sauter
Juhan Nam
Rickmer Schulte
Karol Urbanczyk
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Yi Men
ArXiv (abs)PDFHTML

Papers citing "Multimodal Deep Learning"

50 / 844 papers shown
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic
  Similarity Judgments for MS-COCO
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCOConference of the European Chapter of the Association for Computational Linguistics (EACL), 2020
Zarana Parekh
Jason Baldridge
Daniel Cer
Austin Waters
Yinfei Yang
288
68
0
30 Apr 2020
Multimodal Routing: Improving Local and Global Interpretability of
  Multimodal Language Analysis
Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis
Yifan Hao
Martin Q. Ma
Muqiao Yang
Ruslan Salakhutdinov
Louis-Philippe Morency
146
4
0
29 Apr 2020
EmbraceNet for Activity: A Deep Multimodal Fusion Architecture for
  Activity Recognition
EmbraceNet for Activity: A Deep Multimodal Fusion Architecture for Activity Recognition
Jun-Ho Choi
Jong-Seok Lee
72
22
0
29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual
  Perspective
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
348
36
0
28 Apr 2020
Deep Auto-Encoders with Sequential Learning for Multimodal Dimensional
  Emotion Recognition
Deep Auto-Encoders with Sequential Learning for Multimodal Dimensional Emotion RecognitionIEEE transactions on multimedia (TMM), 2020
Dung Nguyen
D. Nguyen
Rui Zeng
Thanh Thi Nguyen
Son N. Tran
Thin Nguyen
Sridha Sridharan
Clinton Fookes
124
57
0
28 Apr 2020
Data-driven Flood Emulation: Speeding up Urban Flood Predictions by Deep
  Convolutional Neural Networks
Data-driven Flood Emulation: Speeding up Urban Flood Predictions by Deep Convolutional Neural NetworksJournal of Flood Risk Management (JFRM), 2020
Zifeng Guo
J. Leitão
N. Simões
V. Moosavi
AI4CE
106
146
0
17 Apr 2020
How to Teach DNNs to Pay Attention to the Visual Modality in Speech
  Recognition
How to Teach DNNs to Pay Attention to the Visual Modality in Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
George Sterpu
Christian Saam
N. Harte
198
33
0
17 Apr 2020
Sound of Guns: Digital Forensics of Gun Audio Samples meets Artificial
  Intelligence
Sound of Guns: Digital Forensics of Gun Audio Samples meets Artificial IntelligenceMultimedia tools and applications (MTA), 2020
Simone Raponi
I. M. Ali
Gabriele Oligeri
129
32
0
15 Apr 2020
Composite Travel Generative Adversarial Networks for Tabular and
  Sequential Population Synthesis
Composite Travel Generative Adversarial Networks for Tabular and Sequential Population Synthesis
Godwin Badu-Marfo
Bilal Farooq
Zachary Patterson
112
36
0
15 Apr 2020
Analysis of Social Media Data using Multimodal Deep Learning for
  Disaster Response
Analysis of Social Media Data using Multimodal Deep Learning for Disaster ResponseInternational Conference on Information Systems for Crisis Response and Management (ISCRAM), 2020
Ferda Ofli
Firoj Alam
Muhammad Imran
140
123
0
14 Apr 2020
Multimodal Categorization of Crisis Events in Social Media
Multimodal Categorization of Crisis Events in Social MediaComputer Vision and Pattern Recognition (CVPR), 2020
Mahdi Abavisani
Liwei Wu
Shengli Hu
Joel R. Tetreault
A. Jaimes
289
113
0
10 Apr 2020
Deep Multimodal Feature Encoding for Video Ordering
Deep Multimodal Feature Encoding for Video Ordering
Vivek Sharma
Makarand Tapaswi
Rainer Stiefelhagen
173
11
0
05 Apr 2020
Multimodal Material Classification for Robots using Spectroscopy and
  High Resolution Texture Imaging
Multimodal Material Classification for Robots using Spectroscopy and High Resolution Texture ImagingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Zackory M. Erickson
Eliot Xing
Bharat Srirangam
Sonia Chernova
Charles C. Kemp
255
45
0
02 Apr 2020
Mapping individual differences in cortical architecture using multi-view
  representation learning
Mapping individual differences in cortical architecture using multi-view representation learningIEEE International Joint Conference on Neural Network (IJCNN), 2020
A. Sellami
Franccois-Xavier Dupé
Bastien Cagna
Hachem Kadri
Stéphane Ayache
Thierry Artières
S. Takerkart
142
10
0
01 Apr 2020
Shared Cross-Modal Trajectory Prediction for Autonomous Driving
Shared Cross-Modal Trajectory Prediction for Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2020
Chiho Choi
Joon Hee Choi
Srikanth Malla
Jiachen Li
489
73
0
01 Apr 2020
Knowledge as Priors: Cross-Modal Knowledge Generalization for Datasets
  without Superior Knowledge
Knowledge as Priors: Cross-Modal Knowledge Generalization for Datasets without Superior KnowledgeComputer Vision and Pattern Recognition (CVPR), 2020
Long Zhao
Xi Peng
Yuxiao Chen
Mubbasir Kapadia
Dimitris N. Metaxas
261
68
0
01 Apr 2020
Fashion Meets Computer Vision: A Survey
Fashion Meets Computer Vision: A SurveyACM Computing Surveys (ACM CSUR), 2020
Wen-Huang Cheng
Sijie Song
Chieh-Yun Chen
S. Hidayati
Jiaying Liu
AI4TS
292
108
0
31 Mar 2020
Integrating Physiological Time Series and Clinical Notes with Deep
  Learning for Improved ICU Mortality Prediction
Integrating Physiological Time Series and Clinical Notes with Deep Learning for Improved ICU Mortality Prediction
Satya Narayan Shukla
Benjamin M. Marlin
141
16
0
24 Mar 2020
Variational Inference for Deep Probabilistic Canonical Correlation
  Analysis
Variational Inference for Deep Probabilistic Canonical Correlation Analysis
Mahdi Karami
Dale Schuurmans
BDL
149
4
0
09 Mar 2020
Adversarial Multimodal Representation Learning for Click-Through Rate
  Prediction
Adversarial Multimodal Representation Learning for Click-Through Rate PredictionThe Web Conference (WWW), 2020
Xiang Li
Chao Wang
Jiwei Tan
Xiaoyi Zeng
Dan Ou
Bo Zheng
129
59
0
07 Mar 2020
Deep Multi-Modal Sets
Deep Multi-Modal Sets
A. Reiter
Menglin Jia
Pu Yang
Ser-Nam Lim
BDL
222
4
0
03 Mar 2020
A Semi-supervised Graph Attentive Network for Financial Fraud Detection
A Semi-supervised Graph Attentive Network for Financial Fraud DetectionIndustrial Conference on Data Mining (IDM), 2019
Daixin Wang
J. Lin
Peng Cui
Quanhui Jia
Zhen Wang
Yanming Fang
Quan Yu
Jun Zhou
Shuang Yang
Yuan Qi
GNN
186
427
0
28 Feb 2020
RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper
  High-Accuracy and Low-Latency Spiking Neural Network
RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper High-Accuracy and Low-Latency Spiking Neural NetworkComputer Vision and Pattern Recognition (CVPR), 2020
Bing Han
G. Srinivasan
Kaushik Roy
271
376
0
25 Feb 2020
Real-time Fusion Network for RGB-D Semantic Segmentation Incorporating
  Unexpected Obstacle Detection for Road-driving Images
Real-time Fusion Network for RGB-D Semantic Segmentation Incorporating Unexpected Obstacle Detection for Road-driving ImagesIEEE Robotics and Automation Letters (RA-L), 2020
Lei Sun
Kailun Yang
Xinxin Hu
Weijian Hu
Kaiwei Wang
SSeg
286
155
0
24 Feb 2020
AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent
  Videos with Deep Learning
AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent Videos with Deep LearningIEEE transactions on multimedia (TMM), 2020
Sanchita Ghose
John J. Prevost
VGen
180
50
0
21 Feb 2020
Neural Attentive Multiview Machines
Neural Attentive Multiview MachinesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Oren Barkan
Ori Katz
Noam Koenigstein
HAI
127
22
0
18 Feb 2020
Learning Robust Representations via Multi-View Information Bottleneck
Learning Robust Representations via Multi-View Information BottleneckInternational Conference on Learning Representations (ICLR), 2020
Marco Federici
Anjan Dutta
Patrick Forré
Nate Kushman
Zeynep Akata
SLR
255
309
0
17 Feb 2020
Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis
Hi-Net: Hybrid-fusion Network for Multi-modal MR Image SynthesisIEEE Transactions on Medical Imaging (TMI), 2020
Tao Zhou
Huazhu Fu
Geng Chen
Jianbing Shen
Ling Shao
MedIm
370
325
0
11 Feb 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
630
232
0
23 Jan 2020
Multimodal Deep Unfolding for Guided Image Super-Resolution
Multimodal Deep Unfolding for Guided Image Super-ResolutionIEEE Transactions on Image Processing (TIP), 2020
Iman Marivani
Evaggelia Tsiligianni
Bruno Cornelis
Nikos Deligiannis
SupR
204
50
0
21 Jan 2020
A multimodal deep learning approach for named entity recognition from
  social media
A multimodal deep learning approach for named entity recognition from social media
M. Asgari-Chenaghlu
M. Feizi-Derakhshi
Leili Farzinvash
M. Balafar
C. Motamed
282
36
0
19 Jan 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A SurveyInternational Journal of Automation and Computing (IJAC), 2020
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
223
178
0
14 Jan 2020
Improved Robust ASR for Social Robots in Public Spaces
Improved Robust ASR for Social Robots in Public Spaces
Charles Jankowski
Vishwas Mruthyunjaya
Ruixi Lin
VLM
87
3
0
14 Jan 2020
Multiview Representation Learning for a Union of Subspaces
Multiview Representation Learning for a Union of Subspaces
Nils Holzenberger
R. Arora
89
1
0
30 Dec 2019
Learning from Learning Machines: Optimisation, Rules, and Social Norms
Learning from Learning Machines: Optimisation, Rules, and Social Norms
Travis LaCroix
Yoshua Bengio
85
7
0
29 Dec 2019
Pathomic Fusion: An Integrated Framework for Fusing Histopathology and
  Genomic Features for Cancer Diagnosis and Prognosis
Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and PrognosisIEEE Transactions on Medical Imaging (TMI), 2019
Richard J. Chen
Ming Y. Lu
Jingwen Wang
Drew F. K. Williamson
S. Rodig
N. Lindeman
Faisal Mahmood
377
545
0
18 Dec 2019
Multimodal Self-Supervised Learning for Medical Image Analysis
Multimodal Self-Supervised Learning for Medical Image AnalysisInformation Processing in Medical Imaging (IPMI), 2019
Aiham Taleb
Christoph Lippert
T. Klein
Moin Nabi
SSL
353
122
0
11 Dec 2019
Multimodal Generative Models for Compositional Representation Learning
Multimodal Generative Models for Compositional Representation Learning
Mike Wu
Noah D. Goodman
GANDRL
210
20
0
11 Dec 2019
Self-Supervised Learning of Video-Induced Visual Invariances
Self-Supervised Learning of Video-Induced Visual InvariancesComputer Vision and Pattern Recognition (CVPR), 2019
Michael Tschannen
Josip Djolonga
Marvin Ritter
Aravindh Mahendran
Xiaohua Zhai
N. Houlsby
Sylvain Gelly
Mario Lucic
SSL
370
65
0
05 Dec 2019
See and Read: Detecting Depression Symptoms in Higher Education Students
  Using Multimodal Social Media Data
See and Read: Detecting Depression Symptoms in Higher Education Students Using Multimodal Social Media DataInternational Conference on Web and Social Media (ICWSM), 2019
Paulo Mann
A. Paes
Elton H. Matsushima
208
44
0
03 Dec 2019
Dividing and Conquering Cross-Modal Recipe Retrieval: from Nearest
  Neighbours Baselines to SoTA
Dividing and Conquering Cross-Modal Recipe Retrieval: from Nearest Neighbours Baselines to SoTA
Mikhail Fain
Niall Twomey
Andrey Ponikar
Ryan Fox
Danushka Bollegala
242
20
0
28 Nov 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Self-Supervised Learning by Cross-Modal Audio-Video ClusteringNeural Information Processing Systems (NeurIPS), 2019
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
503
461
0
28 Nov 2019
MMTM: Multimodal Transfer Module for CNN Fusion
MMTM: Multimodal Transfer Module for CNN FusionComputer Vision and Pattern Recognition (CVPR), 2019
Hamid Reza Vaezi Joze
Amirreza Shaban
Michael L. Iuzzolino
K. Koishida
407
350
0
20 Nov 2019
Modal-aware Features for Multimodal Hashing
Modal-aware Features for Multimodal Hashing
Haien Zeng
Hanjiang Lai
Hanlu Chu
Yong Tang
Jian Yin
160
0
0
19 Nov 2019
VLUC: An Empirical Benchmark for Video-Like Urban Computing on Citywide
  Crowd and Traffic Prediction
VLUC: An Empirical Benchmark for Video-Like Urban Computing on Citywide Crowd and Traffic Prediction
Renhe Jiang
Zekun Cai
Zhaonan Wang
Chuang Yang
Z. Fan
Xuan Song
Kota Tsubouchi
Ryosuke Shibasaki
AI4TS
111
11
0
16 Nov 2019
Towards Pose-invariant Lip-Reading
Towards Pose-invariant Lip-ReadingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Shiyang Cheng
Pingchuan Ma
Georgios Tzimiropoulos
Stavros Petridis
Adrian Bulat
Jie Shen
Maja Pantic
270
32
0
14 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and ApplicationsIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2019
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAIAI4TS
325
408
0
10 Nov 2019
Adaptive Fusion Techniques for Multimodal Data
Adaptive Fusion Techniques for Multimodal Data
Gaurav Sahu
Olga Vechtomova
140
16
0
10 Nov 2019
Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep
  Generative Models
Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative ModelsNeural Information Processing Systems (NeurIPS), 2019
Yuge Shi
Siddharth Narayanaswamy
Brooks Paige
Juil Sock
DRL
255
324
0
08 Nov 2019
Towards a General Model of Knowledge for Facial Analysis by Multi-Source
  Transfer Learning
Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning
Valentin Vielzeuf
Alexis Lechervy
S. Pateux
F. Jurie
CVBM
164
5
0
08 Nov 2019
Previous
123...8910...151617
Next
Page 9 of 17
Pageof 17