ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06553
  4. Cited By
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking
  Recipes and Food Images

Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

14 October 2018
Javier Marín
Aritro Biswas
Ferda Ofli
Nick Hynes
Amaia Salvador
Y. Aytar
Ingmar Weber
Antonio Torralba
ArXivPDFHTML

Papers citing "Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images"

27 / 27 papers shown
Title
AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement
AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement
Shivam Singh
Karthik Swaminathan
Nabanita Dash
Ramandeep Singh
Snehasis Banerjee
Mohan Sridharan
Madhava Krishna
LLMAG
LM&Ro
105
0
0
04 Feb 2025
WineGraph: A Graph Representation For Food-Wine Pairing
WineGraph: A Graph Representation For Food-Wine Pairing
Zuzanna Gawrysiak
Agata .Zywot
Agnieszka Ławrynowicz
19
2
0
27 Jun 2024
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food
  Detection
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Jiajun Song
Yang Zhang
Shuqiang Jiang
27
10
0
14 Feb 2024
Food Recommendation as Language Processing (F-RLP): A Personalized and
  Contextual Paradigm
Food Recommendation as Language Processing (F-RLP): A Personalized and Contextual Paradigm
Ali Rostami
Ramesh C. Jain
Amir M. Rahmani
30
1
0
12 Feb 2024
Cultural Adaptation of Recipes
Cultural Adaptation of Recipes
Yong Cao
Yova Kementchedjhieva
Ruixiang Cui
Antonia Karamolegkou
Li Zhou
Megan Dare
Lucia Donatelli
Daniel Hershcovich
18
5
0
26 Oct 2023
KitchenScale: Learning to predict ingredient quantities from recipe
  contexts
KitchenScale: Learning to predict ingredient quantities from recipe contexts
Donghee Choi
Mogan Gim
Samy Badreddine
Hajung Kim
Donghyeon Park
Jaewoo Kang
18
6
0
21 Apr 2023
Learning to Substitute Ingredients in Recipes
Learning to Substitute Ingredients in Recipes
Bahare Fatemi
Quentin Duval
Rohit Girdhar
M. Drozdzal
Adriana Romero Soriano
20
7
0
15 Feb 2023
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
Vaclav Kosar
A. Hoskovec
Milan Šulc
Radek Bartyzal
VLM
29
3
0
17 Nov 2022
Human in the loop approaches in multi-modal conversational task guidance
  system development
Human in the loop approaches in multi-modal conversational task guidance system development
R. Manuvinakurike
Sovan Biswas
G. Raffa
R. Beckwith
A. Rhodes
Meng Shi
Gesem Gudino Mejia
Saurav Sahay
L. Nachman
29
2
0
03 Nov 2022
Task Tree Retrieval for Robotic Cooking
Task Tree Retrieval for Robotic Cooking
Sandeep Bondalapati
16
0
0
03 Nov 2022
QMRNet: Quality Metric Regression for EO Image Quality Assessment and
  Super-Resolution
QMRNet: Quality Metric Regression for EO Image Quality Assessment and Super-Resolution
David Berga
Pau Gallés
K. Takáts
Eva Mohedano
Laura Riordan-Chen
Clara Garcia-Moll
David Vilaseca
Javier Marín
SupR
22
2
0
12 Oct 2022
Towards the Creation of a Nutrition and Food Group Based Image Database
Towards the Creation of a Nutrition and Food Group Based Image Database
Zeman Shao
Jiangpeng He
Yaohui Yu
Luotao Lin
Alexandra E Cowan
H. Eicher-Miller
Fengqing M Zhu
14
6
0
05 Jun 2022
Task2Dial: A Novel Task and Dataset for Commonsense enhanced Task-based
  Dialogue Grounded in Documents
Task2Dial: A Novel Task and Dataset for Commonsense enhanced Task-based Dialogue Grounded in Documents
Carl Strathearn
Dimitra Gkatzia
30
8
0
03 Apr 2022
Should I take a walk? Estimating Energy Expenditure from Video Data
Should I take a walk? Estimating Energy Expenditure from Video Data
Kunyu Peng
Alina Roitberg
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
11
4
0
01 Feb 2022
Turath-150K: Image Database of Arab Heritage
Turath-150K: Image Database of Arab Heritage
Dani Kiyasseh
Rasheed el-Bouri
13
0
0
01 Jan 2022
Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval
  with Deep Feature Engineering
Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering
Zhongwei Xie
Ling Liu
Yanzhao Wu
Luo Zhong
Lin Li
6
23
0
22 Oct 2021
Large Scale Visual Food Recognition
Large Scale Visual Food Recognition
Weiqing Min
Zhiling Wang
Yuxin Liu
Mengjia Luo
Lijuan Kang
Xiaoming Wei
Xiaolin K. Wei
Shuqiang Jiang
29
139
0
30 Mar 2021
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers
  and Self-supervised Learning
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
Amaia Salvador
Erhan Gundogdu
Loris Bazzani
M. Donoser
ViT
10
57
0
24 Mar 2021
Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
Quin Thames
Arjun Karpur
W. Norris
Fangting Xia
Liviu Panait
Tobias Weyand
Jack Sim
21
83
0
04 Mar 2021
MultiSubs: A Large-scale Multimodal and Multilingual Dataset
MultiSubs: A Large-scale Multimodal and Multilingual Dataset
Josiah Wang
Pranava Madhyastha
J. Figueiredo
Chiraag Lala
Lucia Specia
VGen
14
11
0
02 Mar 2021
CHEF: Cross-modal Hierarchical Embeddings for Food Domain Retrieval
CHEF: Cross-modal Hierarchical Embeddings for Food Domain Retrieval
Hai Xuan Pham
Ricardo Guerrero
Jiatong Li
Vladimir Pavlovic
11
20
0
04 Feb 2021
Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry and
  Fusion
Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry and Fusion
Yang Wang
33
195
0
15 Jun 2020
Unsupervised Adversarial Image Inpainting
Unsupervised Adversarial Image Inpainting
Arthur Pajot
Emmanuel de Bézenac
Patrick Gallinari
SSL
GAN
6
8
0
18 Dec 2019
When Segmentation is Not Enough: Rectifying Visual-Volume Discordance
  Through Multisensor Depth-Refined Semantic Segmentation for Food Intake
  Tracking in Long-Term Care
When Segmentation is Not Enough: Rectifying Visual-Volume Discordance Through Multisensor Depth-Refined Semantic Segmentation for Food Intake Tracking in Long-Term Care
Kaylen J. Pfisterer
Robert Amelard
A. Chung
Braeden Syrnyk
Alexander MacLean
Heather H. Keller
A. Wong
22
19
0
24 Oct 2019
MMED: A Multi-domain and Multi-modality Event Dataset
MMED: A Multi-domain and Multi-modality Event Dataset
Zhenguo Yang
Zehang Lin
Min Cheng
Qing Li
Wenyin Liu
26
9
0
04 Apr 2019
Learning Shared Semantic Space with Correlation Alignment for
  Cross-modal Event Retrieval
Learning Shared Semantic Space with Correlation Alignment for Cross-modal Event Retrieval
Zhenguo Yang
Zehang Lin
Peipei Kang
Jianming Lv
Qing Li
Wenyin Liu
3DPC
57
26
0
14 Jan 2019
Inverse Cooking: Recipe Generation from Food Images
Inverse Cooking: Recipe Generation from Food Images
Amaia Salvador
M. Drozdzal
Xavier Giró-i-Nieto
Adriana Romero
16
147
0
14 Dec 2018
1