Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1611.08481
Cited By
v1
v2 (latest)
GuessWhat?! Visual object discovery through multi-modal dialogue
23 November 2016
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GuessWhat?! Visual object discovery through multi-modal dialogue"
50 / 237 papers shown
Title
LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue
Anthony Sicilia
Malihe Alikhani
237
6
0
14 Oct 2022
Understanding Embodied Reference with Touch-Line Transformer
International Conference on Learning Representations (ICLR), 2022
Yongqian Li
Xiaoxue Chen
Hao Zhao
Jiangtao Gong
Guyue Zhou
Federico Rossano
Yixin Zhu
254
20
0
11 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ye Zhu
Yuehua Wu
Andrii Zadaianchuk
Yan Yan
342
37
0
05 Oct 2022
Learning More May Not Be Better: Knowledge Transferability in Vision and Language Tasks
Journal of Imaging (JI), 2022
Tianwei Chen
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Hajime Nagahara
VLM
120
1
0
23 Aug 2022
Modeling Non-Cooperative Dialogue: Theoretical and Empirical Insights
Transactions of the Association for Computational Linguistics (TACL), 2022
Anthony Sicilia
Tristan D. Maidment
Pat Healy
Malihe Alikhani
135
4
0
15 Jul 2022
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Natural Language Processing and Chinese Computing (NLPCC), 2022
Bin Li
Yixuan Weng
Ziyu Ma
Bin Sun
Shutao Li
VLM
129
2
0
05 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
229
2
0
02 Jul 2022
Multimodal Dialogue State Tracking
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Hung Le
Nancy F. Chen
Guosheng Lin
136
10
0
16 Jun 2022
FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation
Neural Information Processing Systems (NeurIPS), 2022
Mehmet Özgür Türkoglu
Alexander Becker
H. Gündüz
Mina Rezaei
B. Bischl
Rodrigo Caye Daudt
Stefano Dáronco
Jan Dirk Wegner
Konrad Schindler
FedML
UQCV
510
33
0
31 May 2022
The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training
Computer Vision and Pattern Recognition (CVPR), 2022
Gi-Cheon Kang
Sungdong Kim
Jin-Hwa Kim
Donghyun Kwak
Byoung-Tak Zhang
274
16
0
25 May 2022
Multimodal Conversational AI: A Survey of Datasets and Approaches
Anirudh S. Sundar
Larry Heck
158
32
0
13 May 2022
Supplementing Missing Visions via Dialog for Scene Graph Generations
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhenghao Zhao
Ye Zhu
Xiaoguang Zhu
Yuzhang Shang
Yan Yan
166
1
0
23 Apr 2022
Learning to Execute Actions or Ask Clarification Questions
Zhengxiang Shi
Yue Feng
Aldo Lipani
LM&Ro
224
48
0
18 Apr 2022
Co-VQA : Answering by Interactive Sub Question Sequence
Findings (Findings), 2022
Ruonan Wang
Yuxi Qian
Fangxiang Feng
Xiaojie Wang
Huixing Jiang
LRM
156
19
0
02 Apr 2022
Image Retrieval from Contextual Descriptions
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Benno Krojer
Vaibhav Adlakha
Vibhav Vineet
Yash Goyal
Edoardo Ponti
Siva Reddy
244
38
0
29 Mar 2022
Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Duo Zheng
Fandong Meng
Q. Si
Hairun Fan
Zipeng Xu
Jie Zhou
Fangxiang Feng
Xiaojie Wang
158
0
0
16 Mar 2022
Vision-Language Intelligence: Tasks, Representation Learning, and Large Models
Feng Li
Hao Zhang
Yi-Fan Zhang
Shixuan Liu
Jian Guo
L. Ni
Pengchuan Zhang
Lei Zhang
AI4TS
VLM
189
40
0
03 Mar 2022
CAISE: Conversational Agent for Image Search and Editing
AAAI Conference on Artificial Intelligence (AAAI), 2022
Hyounghun Kim
Doo Soon Kim
Seunghyun Yoon
Franck Dernoncourt
Trung Bui
Joey Tianyi Zhou
204
6
0
24 Feb 2022
Multimodal Interactions Using Pretrained Unimodal Models for SIMMC 2.0
Joosung Lee
Kijong Han
185
6
0
10 Dec 2021
Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Yikai Wang
Gang Hua
Wenbing Huang
Fengxiang He
Dacheng Tao
257
43
0
04 Dec 2021
Building Goal-Oriented Dialogue Systems with Situated Visual Context
AAAI Conference on Artificial Intelligence (AAAI), 2021
Sanchit Agarwal
Jan Jezabek
Arijit Biswas
Emre Barut
Shuyang Gao
Tagyoung Chung
145
1
0
22 Nov 2021
Open-domain clarification question generation without question examples
Julia White
Gabriel Poesia
Robert D. Hawkins
Dorsa Sadigh
Noah D. Goodman
95
28
0
19 Oct 2021
A Framework for Learning to Request Rich and Contextually Useful Information from Humans
Khanh Nguyen
Yonatan Bisk
Hal Daumé
452
20
0
14 Oct 2021
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts
Shuhe Wang
Yuxian Meng
Xiaoya Li
Xiaofei Sun
Rongbin Ouyang
Jiwei Li
MLLM
VLM
211
23
0
27 Sep 2021
Learning Natural Language Generation from Scratch
Alice Martin Donati
Guillaume Quispe
Charles Ollion
Sylvain Le Corff
Florian Strub
Olivier Pietquin
LRM
131
4
0
20 Sep 2021
Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy
A. Testoni
Raffaella Bernardi
127
11
0
11 Sep 2021
We went to look for meaning and all we got were these lousy representations: aspects of meaning representation for computational semantics
Simon Dobnik
R. Cooper
Adam Ek
Bill Noble
Staffan Larsson
N. Ilinykh
Vladislav Maraev
Vidya Somashekarappa
130
0
0
10 Sep 2021
YouRefIt: Embodied Reference Understanding with Language and Gesture
IEEE International Conference on Computer Vision (ICCV), 2021
Yixin Chen
Qing Li
Deqian Kong
Yik Lun Kei
Song-Chun Zhu
Tao Gao
Yixin Zhu
Siyuan Huang
LM&Ro
223
48
0
08 Sep 2021
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Duo Zheng
Zipeng Xu
Fandong Meng
Caixia Yuan
Jiaan Wang
Jie Zhou
102
13
0
06 Sep 2021
Chest ImaGenome Dataset for Clinical Reasoning
Joy T. Wu
Nkechinyere N. Agu
Ismini Lourentzou
Arjun Sharma
J. Paguio
...
William Mitchell
Satyananda Kashyap
Andrea Giovannini
Leo Anthony Celi
Mehdi Moradi
227
89
0
31 Jul 2021
Language Grounding with 3D Objects
Conference on Robot Learning (CoRL), 2021
Jesse Thomason
Mohit Shridhar
Yonatan Bisk
Chris Paxton
Luke Zettlemoyer
LM&Ro
198
54
0
26 Jul 2021
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
190
20
0
23 Jul 2021
Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue
Zipeng Xu
Fandong Meng
Caixia Yuan
Duo Zheng
Chenxu Lv
Jie Zhou
OffRL
161
6
0
12 Jul 2021
Target-dependent UNITER: A Transformer-Based Multimodal Language Comprehension Model for Domestic Service Robots
Shintaro Ishikawa
K. Sugiura
164
13
0
02 Jul 2021
Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue
IEEE International Conference on Computer Vision (ICCV), 2021
Shoya Matsumori
Kosuke Shingyouchi
Yukikoko Abe
Yosuke Fukuchi
K. Sugiura
M. Imai
168
17
0
29 Jun 2021
Saying the Unseen: Video Descriptions via Dialog Agents
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
190
8
0
26 Jun 2021
C
3
C^3
C
3
: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues
Hung Le
Nancy F. Chen
Guosheng Lin
107
2
0
16 Jun 2021
Grounding 'Grounding' in NLP
Findings (Findings), 2021
Khyathi Chandu
Yonatan Bisk
A. Black
149
57
0
04 Jun 2021
Maintaining Common Ground in Dynamic Environments
Transactions of the Association for Computational Linguistics (TACL), 2021
Takuma Udagawa
Akiko Aizawa
160
15
0
29 May 2021
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation
Computer Vision and Pattern Recognition (CVPR), 2021
Tao Tu
Q. Ping
Govind Thattai
Gokhan Tur
Premkumar Natarajan
133
18
0
24 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Artificial Intelligence Review (AIR), 2021
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Xiaoshi Zhong
779
320
0
10 May 2021
A recipe for annotating grounded clarifications
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Luciana Benotti
P. Blackburn
82
19
0
18 Apr 2021
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Satwik Kottur
Seungwhan Moon
A. Geramifard
Babak Damavandi
220
98
0
18 Apr 2021
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
Journal of Artificial Intelligence Research (JAIR), 2021
E. Razumovskaia
Goran Glavaš
Olga Majewska
Edoardo Ponti
Anna Korhonen
Ivan Vulić
481
37
0
17 Apr 2021
Ensemble of MRR and NDCG models for Visual Dialog
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Idan Schwartz
226
10
0
15 Apr 2021
Structured Co-reference Graph Attention for Video-grounded Dialogue
AAAI Conference on Artificial Intelligence (AAAI), 2021
Junyeong Kim
Sunjae Yoon
Dahyun Kim
Chang D. Yoo
202
29
0
24 Mar 2021
The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
A. Testoni
Raffaella Bernardi
82
4
0
20 Mar 2021
Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training
Italian Conference on Computational Linguistics (CLiC-it), 2021
A. Testoni
Raffaella Bernardi
99
2
0
20 Mar 2021
I Want This Product but Different : Multimodal Retrieval with Synthetic Query Expansion
Ivona Tautkute
Tomasz Trzciñski
219
5
0
17 Feb 2021
Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue
AAAI Conference on Artificial Intelligence (AAAI), 2021
Han Liu
Caixia Yuan
Xiaojie Wang
Yushu Yang
Huixing Jiang
Zhongyuan Wang
205
1
0
04 Feb 2021
Previous
1
2
3
4
5
Next