Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.01095
Cited By
MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model
2 April 2020
Han Fu
R. Wu
Chenghao Liu
Jianling Sun
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model"
26 / 26 papers shown
Title
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
142
10,591
0
17 Feb 2020
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Zihao Wang
Xihui Liu
Hongsheng Li
Lu Sheng
Junjie Yan
Xiaogang Wang
Jing Shao
VLM
37
300
0
12 Sep 2019
Visual Semantic Reasoning for Image-Text Matching
Kunpeng Li
Yulun Zhang
Keqin Li
Yuanyuan Li
Y. Fu
VLM
58
500
0
06 Sep 2019
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Yale Song
M. Soleymani
35
242
0
11 Jun 2019
Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images
Hao Wang
Doyen Sahoo
Chenghao Liu
Ee-Peng Lim
Guosheng Lin
21
132
0
03 May 2019
Diagnosing and Enhancing VAE Models
Bin Dai
David Wipf
DRL
34
378
0
14 Mar 2019
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images
Javier Marín
Aritro Biswas
Ferda Ofli
Nick Hynes
Amaia Salvador
Y. Aytar
Ingmar Weber
Antonio Torralba
27
323
0
14 Oct 2018
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
Semih Yagcioglu
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
CoGe
41
171
0
04 Sep 2018
Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings
Micael Carvalho
Rémi Cadène
David Picard
Laure Soulier
Nicolas Thome
Matthieu Cord
36
180
0
30 Apr 2018
Finding beans in burgers: Deep semantic-visual embedding with localization
Martin Engilberge
Louis Chevallier
P. Pérez
Matthieu Cord
32
95
0
05 Apr 2018
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
56
1,148
0
21 Mar 2018
CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise
Kuang-Huei Lee
Xiaodong He
Lei Zhang
Linjun Yang
NoLa
48
454
0
20 Nov 2017
Simulating Action Dynamics with Neural Process Networks
Antoine Bosselut
Omer Levy
Ari Holtzman
C. Ennis
Dieter Fox
Yejin Choi
MILM
AI4CE
53
120
0
14 Nov 2017
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
Yan Huang
Wei Wang
Liang Wang
37
222
0
17 Nov 2016
DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment
Chang Liu
Yu Cao
Yan Luo
Guanling Chen
V. Vokkarane
Yunsheng Ma
27
253
0
17 Jun 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.2K
192,638
0
10 Dec 2015
Fine-grained Image Classification by Exploring Bipartite-Graph Labels
Feng Zhou
Yuanqing Lin
44
131
0
08 Dec 2015
Generating Sentences from a Continuous Space
Samuel R. Bowman
Luke Vilnis
Oriol Vinyals
Andrew M. Dai
Rafal Jozefowicz
Samy Bengio
DRL
69
2,352
0
19 Nov 2015
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
60
782
0
19 Nov 2015
Skip-Thought Vectors
Ryan Kiros
Yukun Zhu
Ruslan Salakhutdinov
R. Zemel
Antonio Torralba
R. Urtasun
Sanja Fidler
SSL
126
2,405
0
22 Jun 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
244
13,079
0
12 Mar 2015
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision
J. Malmaud
Jonathan Huang
V. Rathod
Nick Johnston
Andrew Rabinovich
Kevin Patrick Murphy
48
152
0
05 Mar 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
632
149,474
0
22 Dec 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
359
27,205
0
01 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
577
23,235
0
03 Jun 2014
A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics
Yunchao Gong
Qifa Ke
Michael Isard
Svetlana Lazebnik
3DV
113
584
0
18 Dec 2012
1