ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.01095
  4. Cited By
MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images
  with Latent Variable Model

MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model

2 April 2020
Han Fu
R. Wu
Chenghao Liu
Jianling Sun
ArXivPDFHTML

Papers citing "MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model"

26 / 26 papers shown
Title
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
142
10,591
0
17 Feb 2020
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Zihao Wang
Xihui Liu
Hongsheng Li
Lu Sheng
Junjie Yan
Xiaogang Wang
Jing Shao
VLM
37
300
0
12 Sep 2019
Visual Semantic Reasoning for Image-Text Matching
Visual Semantic Reasoning for Image-Text Matching
Kunpeng Li
Yulun Zhang
Keqin Li
Yuanyuan Li
Y. Fu
VLM
58
500
0
06 Sep 2019
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Yale Song
M. Soleymani
35
242
0
11 Jun 2019
Learning Cross-Modal Embeddings with Adversarial Networks for Cooking
  Recipes and Food Images
Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images
Hao Wang
Doyen Sahoo
Chenghao Liu
Ee-Peng Lim
Guosheng Lin
21
132
0
03 May 2019
Diagnosing and Enhancing VAE Models
Diagnosing and Enhancing VAE Models
Bin Dai
David Wipf
DRL
34
378
0
14 Mar 2019
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking
  Recipes and Food Images
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images
Javier Marín
Aritro Biswas
Ferda Ofli
Nick Hynes
Amaia Salvador
Y. Aytar
Ingmar Weber
Antonio Torralba
27
323
0
14 Oct 2018
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking
  Recipes
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
Semih Yagcioglu
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
CoGe
41
171
0
04 Sep 2018
Cross-Modal Retrieval in the Cooking Context: Learning Semantic
  Text-Image Embeddings
Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings
Micael Carvalho
Rémi Cadène
David Picard
Laure Soulier
Nicolas Thome
Matthieu Cord
36
180
0
30 Apr 2018
Finding beans in burgers: Deep semantic-visual embedding with
  localization
Finding beans in burgers: Deep semantic-visual embedding with localization
Martin Engilberge
Louis Chevallier
P. Pérez
Matthieu Cord
32
95
0
05 Apr 2018
Stacked Cross Attention for Image-Text Matching
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
56
1,148
0
21 Mar 2018
CleanNet: Transfer Learning for Scalable Image Classifier Training with
  Label Noise
CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise
Kuang-Huei Lee
Xiaodong He
Lei Zhang
Linjun Yang
NoLa
48
454
0
20 Nov 2017
Simulating Action Dynamics with Neural Process Networks
Simulating Action Dynamics with Neural Process Networks
Antoine Bosselut
Omer Levy
Ari Holtzman
C. Ennis
Dieter Fox
Yejin Choi
MILM
AI4CE
53
120
0
14 Nov 2017
Instance-aware Image and Sentence Matching with Selective Multimodal
  LSTM
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
Yan Huang
Wei Wang
Liang Wang
37
222
0
17 Nov 2016
DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided
  Dietary Assessment
DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment
Chang Liu
Yu Cao
Yan Luo
Guanling Chen
V. Vokkarane
Yunsheng Ma
27
253
0
17 Jun 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.2K
192,638
0
10 Dec 2015
Fine-grained Image Classification by Exploring Bipartite-Graph Labels
Fine-grained Image Classification by Exploring Bipartite-Graph Labels
Feng Zhou
Yuanqing Lin
44
131
0
08 Dec 2015
Generating Sentences from a Continuous Space
Generating Sentences from a Continuous Space
Samuel R. Bowman
Luke Vilnis
Oriol Vinyals
Andrew M. Dai
Rafal Jozefowicz
Samy Bengio
DRL
69
2,352
0
19 Nov 2015
Learning Deep Structure-Preserving Image-Text Embeddings
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
60
782
0
19 Nov 2015
Skip-Thought Vectors
Skip-Thought Vectors
Ryan Kiros
Yukun Zhu
Ruslan Salakhutdinov
R. Zemel
Antonio Torralba
R. Urtasun
Sanja Fidler
SSL
126
2,405
0
22 Jun 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
244
13,079
0
12 Mar 2015
What's Cookin'? Interpreting Cooking Videos using Text, Speech and
  Vision
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision
J. Malmaud
Jonathan Huang
V. Rathod
Nick Johnston
Andrew Rabinovich
Kevin Patrick Murphy
48
152
0
05 Mar 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
632
149,474
0
22 Dec 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
359
27,205
0
01 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
577
23,235
0
03 Jun 2014
A Multi-View Embedding Space for Modeling Internet Images, Tags, and
  their Semantics
A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics
Yunchao Gong
Qifa Ke
Michael Isard
Svetlana Lazebnik
3DV
113
584
0
18 Dec 2012
1