ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.08481
  4. Cited By
GuessWhat?! Visual object discovery through multi-modal dialogue
v1v2 (latest)

GuessWhat?! Visual object discovery through multi-modal dialogue

23 November 2016
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
    VLM
ArXiv (abs)PDFHTML

Papers citing "GuessWhat?! Visual object discovery through multi-modal dialogue"

50 / 237 papers shown
Title
Building Task-Oriented Visual Dialog Systems Through Alternative
  Optimization Between Dialog Policy and Language Generation
Building Task-Oriented Visual Dialog Systems Through Alternative Optimization Between Dialog Policy and Language GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Mingyang Zhou
Josh Arnold
Zhou Yu
OffRL
144
11
0
06 Sep 2019
"Can you say more about the location?" The Development of a Pedagogical
  Reference Resolution Agent
"Can you say more about the location?" The Development of a Pedagogical Reference Resolution Agent
Maike Paetzel
R. Manuvinakurike
48
7
0
03 Sep 2019
Grounded Agreement Games: Emphasizing Conversational Grounding in Visual
  Dialogue Settings
Grounded Agreement Games: Emphasizing Conversational Grounding in Visual Dialogue Settings
David Schlangen
107
16
0
29 Aug 2019
Zero-Shot Grounding of Objects from Natural Language Queries
Zero-Shot Grounding of Objects from Natural Language QueriesIEEE International Conference on Computer Vision (ICCV), 2019
Arka Sadhu
Kan Chen
Ram Nevatia
ObjD
203
171
0
20 Aug 2019
Towards Knowledge-Based Recommender Dialog System
Towards Knowledge-Based Recommender Dialog SystemConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Qibin Chen
Junyang Lin
Yichang Zhang
Ming Ding
Yukuo Cen
Hongxia Yang
Jie Tang
160
284
0
15 Aug 2019
What Should I Ask? Using Conversationally Informative Rewards for
  Goal-Oriented Visual Dialog
What Should I Ask? Using Conversationally Informative Rewards for Goal-Oriented Visual DialogAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Pushkar Shukla
Carlos E. L. Elmadjian
Richika Sharan
Vivek Kulkarni
Matthew Turk
William Yang Wang
164
34
0
28 Jul 2019
Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing
  Analytic Experts
Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing Analytic ExpertsIEEE International Conference on Multimedia and Expo (ICME), 2019
Yenchih Chang
Wen-Hsiao Peng
102
4
0
24 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and MethodsJournal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
352
141
0
22 Jul 2019
MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment
MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment
N. Ilinykh
Sina Zarrieß
David Schlangen
101
44
0
11 Jul 2019
A Natural Language Corpus of Common Grounding under Continuous and
  Partially-Observable Context
A Natural Language Corpus of Common Grounding under Continuous and Partially-Observable ContextAAAI Conference on Artificial Intelligence (AAAI), 2019
Takuma Udagawa
Akiko Aizawa
130
50
0
08 Jul 2019
RUBi: Reducing Unimodal Biases in Visual Question Answering
RUBi: Reducing Unimodal Biases in Visual Question AnsweringNeural Information Processing Systems (NeurIPS), 2019
Rémi Cadène
Corentin Dancette
H. Ben-younes
Matthieu Cord
Devi Parikh
CML
262
401
0
24 Jun 2019
The PhotoBook Dataset: Building Common Ground through Visually-Grounded
  Dialogue
The PhotoBook Dataset: Building Common Ground through Visually-Grounded DialogueAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
J. Haber
Tim Baumgärtner
Ece Takmaz
Lieke Gelderloos
Elia Bruni
Raquel Fernández
152
84
0
04 Jun 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language
  Feedback
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback
Hui Wu
Yupeng Gao
Xiaoxiao Guo
Ziad Al-Halah
Steven J. Rennie
Kristen Grauman
Rogerio Feris
EgoV
279
75
0
30 May 2019
Show, Price and Negotiate: A Negotiator with Online Value Look-Ahead
Show, Price and Negotiate: A Negotiator with Online Value Look-Ahead
Amin Parvaneh
Ehsan Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton van den Hengel
OffRL
150
5
0
07 May 2019
Evaluating the Representational Hub of Language and Vision Models
Evaluating the Representational Hub of Language and Vision Models
Ravi Shekhar
Ece Takmaz
Raquel Fernández
Raffaella Bernardi
138
12
0
12 Apr 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
Alex Schwing
248
110
0
11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
204
118
0
11 Apr 2019
Can You Explain That? Lucid Explanations Help Human-AI Collaborative
  Image Retrieval
Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval
Arijit Ray
Yi Yao
Rakesh Kumar
Ajay Divakaran
Giedrius Burachas
222
5
0
05 Apr 2019
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual
  Dialog
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog
Satwik Kottur
José M. F. Moura
Devi Parikh
Dhruv Batra
Marcus Rohrbach
176
91
0
07 Mar 2019
Generative Visual Dialogue System via Adaptive Reasoning and Weighted
  Likelihood Estimation
Generative Visual Dialogue System via Adaptive Reasoning and Weighted Likelihood Estimation
Heming Zhang
Shalini Ghosh
Larry Heck
Stephen Walsh
Junting Zhang
Jie Zhang
C.-C. Jay Kuo
203
7
0
26 Feb 2019
Image-Question-Answer Synergistic Network for Visual Dialog
Image-Question-Answer Synergistic Network for Visual DialogComputer Vision and Pattern Recognition (CVPR), 2019
Dalu Guo
Chang Xu
Dacheng Tao
134
77
0
26 Feb 2019
Making History Matter: History-Advantage Sequence Training for Visual
  Dialog
Making History Matter: History-Advantage Sequence Training for Visual Dialog
Tianhao Yang
Zhengjun Zha
Hanwang Zhang
OffRL
202
8
0
25 Feb 2019
Large-Scale Answerer in Questioner's Mind for Visual Dialog Question
  Generation
Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation
Sang-Woo Lee
Tong Gao
Sohee Yang
Jaejun Yoo
Jung-Woo Ha
127
18
0
22 Feb 2019
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
Multi-step Reasoning via Recurrent Dual Attention for Visual DialogAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Zhe Gan
Yu Cheng
Ahmed El Kholy
Linjie Li
Jingjing Liu
Jianfeng Gao
299
106
0
01 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI ResearchArtificial Intelligence (AI), 2019
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
398
388
0
01 Feb 2019
Multi-modal dialog for browsing large visual catalogs using
  exploration-exploitation paradigm in a joint embedding space
Multi-modal dialog for browsing large visual catalogs using exploration-exploitation paradigm in a joint embedding space
Indrani Bhattacharya
Arkabandhu Chowdhury
V. Raykar
156
6
0
28 Jan 2019
Sequential Attention GAN for Interactive Image Editing
Sequential Attention GAN for Interactive Image Editing
Yu Cheng
Zhe Gan
Yitong Li
Jingjing Liu
Jianfeng Gao
253
108
0
20 Dec 2018
From FiLM to Video: Multi-turn Question Answering with Multi-modal
  Context
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
122
34
0
17 Dec 2018
Visual Dialogue without Vision or Dialogue
Visual Dialogue without Vision or Dialogue
Daniela Massiceti
P. Dokania
N. Siddharth
Juil Sock
216
34
0
16 Dec 2018
What's to know? Uncertainty as a Guide to Asking Goal-oriented Questions
What's to know? Uncertainty as a Guide to Asking Goal-oriented Questions
Ehsan Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton Van Den Hengel
96
20
0
16 Dec 2018
Gold Seeker: Information Gain from Policy Distributions for
  Goal-oriented Vision-and-Langauge Reasoning
Gold Seeker: Information Gain from Policy Distributions for Goal-oriented Vision-and-Langauge Reasoning
Ehsan Abbasnejad
Iman Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton Van Den Hengel
OffRL
196
5
0
16 Dec 2018
PIRC Net : Using Proposal Indexing, Relationships and Context for Phrase
  Grounding
PIRC Net : Using Proposal Indexing, Relationships and Context for Phrase Grounding
Rama Kovvuri
Ram Nevatia
ObjD
144
20
0
07 Dec 2018
Recursive Visual Attention in Visual Dialog
Recursive Visual Attention in Visual Dialog
Yulei Niu
Hanwang Zhang
Manli Zhang
Jianhong Zhang
Zhiwu Lu
Ji-Rong Wen
198
122
0
06 Dec 2018
A System for Automated Image Editing from Natural Language Commands
A System for Automated Image Editing from Natural Language Commands
Jacqueline Brixey
R. Manuvinakurike
Nham Le
T. Lai
W. Chang
Trung Bui
77
4
0
03 Dec 2018
FineGAN: Unsupervised Hierarchical Disentanglement for Fine-Grained
  Object Generation and Discovery
FineGAN: Unsupervised Hierarchical Disentanglement for Fine-Grained Object Generation and Discovery
Krishna Kumar Singh
Utkarsh Ojha
Yong Jae Lee
OCL
271
136
0
27 Nov 2018
Efficient Dialog Policy Learning via Positive Memory Retention
Efficient Dialog Policy Learning via Positive Memory Retention
Rui Zhao
Volker Tresp
168
10
0
02 Oct 2018
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
373
713
0
21 Sep 2018
Game-Based Video-Context Dialogue
Game-Based Video-Context Dialogue
Ramakanth Pasunuru
Joey Tianyi Zhou
139
36
0
12 Sep 2018
Beyond task success: A closer look at jointly learning to see, ask, and
  GuessWhat
Beyond task success: A closer look at jointly learning to see, ask, and GuessWhat
Ravi Shekhar
Aashish Venkatesh
Tim Baumgärtner
Elia Bruni
Barbara Plank
Raffaella Bernardi
Raquel Fernández
130
51
0
10 Sep 2018
Visual Coreference Resolution in Visual Dialog using Neural Module
  Networks
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
Satwik Kottur
José M. F. Moura
Devi Parikh
Dhruv Batra
Marcus Rohrbach
186
168
0
06 Sep 2018
Learning a Policy for Opportunistic Active Learning
Learning a Policy for Opportunistic Active Learning
Aishwarya Padmakumar
Peter Stone
Raymond J. Mooney
165
22
0
29 Aug 2018
Multimodal Differential Network for Visual Question Generation
Multimodal Differential Network for Visual Question Generation
Badri N. Patro
Sandeep Kumar
V. Kurmi
Vinay P. Namboodiri
201
40
0
12 Aug 2018
Community Regularization of Visually-Grounded Dialog
Community Regularization of Visually-Grounded Dialog
Akshat Agarwal
Swaminathan Gurumurthy
Vasu Sharma
M. Lewis
Katia Sycara
133
10
0
10 Aug 2018
Visual Reasoning with Multi-hop Feature Modulation
Visual Reasoning with Multi-hop Feature Modulation
Florian Strub
Mathieu Seurin
Ethan Perez
H. D. Vries
Jérémie Mary
Philippe Preux
Aaron Courville
Olivier Pietquin
217
28
0
03 Aug 2018
Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
Deepthi Karkada
R. Manuvinakurike
Kallirroi Georgila
85
0
0
11 Jul 2018
Talk the Walk: Navigating New York City through Grounded Dialogue
Talk the Walk: Navigating New York City through Grounded Dialogue
H. D. Vries
Kurt Shuster
Dhruv Batra
Devi Parikh
Jason Weston
Douwe Kiela
318
128
0
09 Jul 2018
Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient
Learning Goal-Oriented Visual Dialog via Tempered Policy GradientSpoken Language Technology Workshop (SLT), 2018
Rui Zhao
Volker Tresp
LLMAG
219
14
0
02 Jul 2018
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7
Huda AlAmri
Vincent Cartillier
Raphael Gontijo-Lopes
Abhishek Das
Jue Wang
...
Dhruv Batra
Devi Parikh
A. Cherian
Tim K. Marks
Chiori Hori
132
34
0
01 Jun 2018
Ask No More: Deciding when to guess in referential visual dialogue
Ask No More: Deciding when to guess in referential visual dialogue
Ravi Shekhar
Tim Baumgärtner
Aashish Venkatesh
Elia Bruni
Raffaella Bernardi
Raquel Fernández
135
22
0
17 May 2018
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented
  Visual Dialog
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog
Jiaping Zhang
Tiancheng Zhao
Zhou Yu
131
41
0
08 May 2018
Previous
12345
Next