ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.08481
  4. Cited By
GuessWhat?! Visual object discovery through multi-modal dialogue
v1v2 (latest)

GuessWhat?! Visual object discovery through multi-modal dialogue

23 November 2016
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
    VLM
ArXiv (abs)PDFHTML

Papers citing "GuessWhat?! Visual object discovery through multi-modal dialogue"

37 / 237 papers shown
Title
Dialog-based Interactive Image Retrieval
Dialog-based Interactive Image Retrieval
Xiaoxiao Guo
Hui Wu
Yu Cheng
Steven J. Rennie
Gerald Tesauro
Rogerio Feris
331
226
0
01 May 2018
Customized Image Narrative Generation via Interactive Visual Question
  Generation and Answering
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
173
8
0
27 Apr 2018
Modeling Psychotherapy Dialogues with Kernelized Hashcode
  Representations: A Nonparametric Information-Theoretic Approach
Modeling Psychotherapy Dialogues with Kernelized Hashcode Representations: A Nonparametric Information-Theoretic Approach
S. Garg
Irina Rish
Guillermo Cecchi
Palash Goyal
Sarik Ghazarian
Shuyang Gao
Greg Ver Steeg
Aram Galstyan
240
0
0
26 Apr 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of
  Untranscribed Speech
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David Harwath
Galen Chuang
James R. Glass
143
60
0
09 Apr 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory
  Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
207
206
0
04 Apr 2018
Joint Learning of Interactive Spoken Content Retrieval and Trainable
  User Simulator
Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator
Pei-Hung Chung
Kuan Tung
Ching-Lun Tai
Hung-yi Lee
RALM
124
1
0
01 Apr 2018
Guide Me: Interacting with Deep Networks
Guide Me: Interacting with Deep Networks
Christian Rupprecht
Iro Laina
Nassir Navab
Gregory Hager
Federico Tombari
HAI
135
39
0
30 Mar 2018
A Survey on Deep Learning Toolkits and Libraries for Intelligent User
  Interfaces
A Survey on Deep Learning Toolkits and Libraries for Intelligent User Interfaces
Jan Zacharias
Michael Barz
Daniel Sonntag
VLM
160
33
0
13 Mar 2018
Discriminability objective for training descriptive captions
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
273
208
0
12 Mar 2018
Answerer in Questioner's Mind: Information Theoretic Approach to
  Goal-Oriented Visual Dialog
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog
Sang-Woo Lee
Y. Heo
Byoung-Tak Zhang
191
32
0
12 Feb 2018
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven
  Communication
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication
Jin-Hwa Kim
Nikita Kitaev
Xinlei Chen
Marcus Rohrbach
Byoung-Tak Zhang
Yuandong Tian
Dhruv Batra
Devi Parikh
DiffMVGen
165
25
0
15 Dec 2017
Examining Cooperation in Visual Dialog Models
Examining Cooperation in Visual Dialog Models
Mircea Mironenco
D. Kianfar
Ke M. Tran
Evangelos Kanoulas
E. Gavves
111
4
0
04 Dec 2017
Interactive Reinforcement Learning for Object Grounding via Self-Talking
Interactive Reinforcement Learning for Object Grounding via Self-Talking
Yan Zhu
Shaoting Zhang
Dimitris N. Metaxas
87
8
0
02 Dec 2017
HoME: a Household Multimodal Environment
HoME: a Household Multimodal Environment
Simon Brodeur
Ethan Perez
Ankesh Anand
Florian Golemo
Luca Herranz-Celotti
Florian Strub
Jean Rouat
Hugo Larochelle
Aaron Courville
LM&Ro
193
104
0
29 Nov 2017
Asking the Difficult Questions: Goal-Oriented Visual Question Generation
  via Intermediate Rewards
Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards
Junjie Zhang
Qi Wu
Chunhua Shen
Jian Zhang
Jianfeng Lu
Anton Van Den Hengel
LRM
145
29
0
21 Nov 2017
Are You Talking to Me? Reasoned Visual Dialog Generation through
  Adversarial Learning
Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning
Qi Wu
Peng Wang
Chunhua Shen
Ian Reid
Anton Van Den Hengel
GAN
146
130
0
21 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery
  through Dialogs and Queries
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
151
142
0
17 Nov 2017
Active Learning for Visual Question Answering: An Empirical Study
Active Learning for Visual Question Answering: An Empirical Study
Xiaoyu Lin
Devi Parikh
190
33
0
06 Nov 2017
Learning how to learn: an adaptive dialogue agent for incrementally
  learning visually grounded word meanings
Learning how to learn: an adaptive dialogue agent for incrementally learning visually grounded word meanings
Yanchao Yu
Arash Eshghi
Oliver Lemon
128
20
0
29 Sep 2017
Visual Reference Resolution using Attention Memory for Visual Dialog
Visual Reference Resolution using Attention Memory for Visual Dialog
Paul Hongsuck Seo
Andreas M. Lehrmann
Bohyung Han
Leonid Sigal
199
124
0
23 Sep 2017
A Study of AI Population Dynamics with Million-agent Reinforcement
  Learning
A Study of AI Population Dynamics with Million-agent Reinforcement Learning
Yaodong Yang
Lantao Yu
Yiwei Bai
Jun Wang
Weinan Zhang
Ying Wen
Yong Yu
168
7
0
13 Sep 2017
Reasoning about Fine-grained Attribute Phrases using Reference Games
Reasoning about Fine-grained Attribute Phrases using Reference Games
Jong-Chyi Su
Chenyun Wu
Huaizu Jiang
Subhransu Maji
177
16
0
29 Aug 2017
Evaluating Visual Conversational Agents via Cooperative Human-AI Games
Evaluating Visual Conversational Agents via Cooperative Human-AI Games
Prithvijit Chattopadhyay
Deshraj Yadav
Viraj Prabhu
Arjun Chandrasekaran
Abhishek Das
Stefan Lee
Dhruv Batra
Devi Parikh
167
79
0
17 Aug 2017
Learning to Disambiguate by Asking Discriminative Questions
Learning to Disambiguate by Asking Discriminative QuestionsIEEE International Conference on Computer Vision (ICCV), 2017
Yining Li
Chen Huang
Xiaoou Tang
Chen Change Loy
149
22
0
09 Aug 2017
Grounding Spatio-Semantic Referring Expressions for Human-Robot
  Interaction
Grounding Spatio-Semantic Referring Expressions for Human-Robot Interaction
Mohit Shridhar
David Hsu
ObjD
149
21
0
18 Jul 2017
Learning Visual Reasoning Without Strong Priors
Learning Visual Reasoning Without Strong Priors
Ethan Perez
H. D. Vries
Florian Strub
Vincent Dumoulin
Aaron Courville
OODNAI
364
63
0
10 Jul 2017
Modulating early visual processing by language
Modulating early visual processing by language
H. D. Vries
Florian Strub
Jérémie Mary
Hugo Larochelle
Olivier Pietquin
Aaron Courville
480
511
0
02 Jul 2017
Natural Language Does Not Emerge Ñaturally' in Multi-Agent Dialog
Natural Language Does Not Emerge Ñaturally' in Multi-Agent Dialog
Satwik Kottur
José M. F. Moura
Stefan Lee
Dhruv Batra
LLMAG
171
227
0
26 Jun 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning
  to a Generative Visual Dialog Model
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog ModelNeural Information Processing Systems (NeurIPS), 2017
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
163
137
0
05 Jun 2017
Emergent Communication in a Multi-Modal, Multi-Step Referential Game
Emergent Communication in a Multi-Modal, Multi-Step Referential Game
Katrina Evtimova
Andrew Drozdov
Douwe Kiela
Dong Wang
242
31
0
29 May 2017
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence
  Models for Fill-in-the-Blank Image Captioning
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning
Q. Sun
Stefan Lee
Dhruv Batra
BDL
118
43
0
24 May 2017
C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0
  Dataset
C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset
Aishwarya Agrawal
Aniruddha Kembhavi
Dhruv Batra
Devi Parikh
CoGe
193
80
0
26 Apr 2017
Towards Building Large Scale Multimodal Domain-Aware Conversation
  Systems
Towards Building Large Scale Multimodal Domain-Aware Conversation Systems
Amrita Saha
Mitesh Khapra
Karthik Sankaranarayanan
190
8
0
01 Apr 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement
  Learning
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
298
429
0
20 Mar 2017
End-to-end optimization of goal-driven and visually grounded dialogue
  systems
End-to-end optimization of goal-driven and visually grounded dialogue systems
Florian Strub
H. D. Vries
Jérémie Mary
Bilal Piot
Aaron Courville
Olivier Pietquin
OffRL
131
140
0
15 Mar 2017
Visual Dialog
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
338
1,056
0
26 Nov 2016
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based
  Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationInternational Journal of Computer Vision (IJCV), 2016
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
FAtt
860
23,824
0
07 Oct 2016
Previous
12345