Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.08481
Cited By
GuessWhat?! Visual object discovery through multi-modal dialogue
23 November 2016
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GuessWhat?! Visual object discovery through multi-modal dialogue"
50 / 232 papers shown
Title
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Bin Li
Yixuan Weng
Ziyu Ma
Bin Sun
Shutao Li
VLM
11
2
0
05 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
30
2
0
02 Jul 2022
Multimodal Dialogue State Tracking
Hung Le
Nancy F. Chen
S. Hoi
28
9
0
16 Jun 2022
FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation
Mehmet Özgür Türkoglu
Alexander Becker
H. Gündüz
Mina Rezaei
Bernd Bischl
Rodrigo Caye Daudt
Stefano Dáronco
Jan Dirk Wegner
Konrad Schindler
FedML
UQCV
38
25
0
31 May 2022
The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training
Gi-Cheon Kang
Sungdong Kim
Jin-Hwa Kim
Donghyun Kwak
Byoung-Tak Zhang
24
10
0
25 May 2022
Multimodal Conversational AI: A Survey of Datasets and Approaches
Anirudh S. Sundar
Larry Heck
30
29
0
13 May 2022
Supplementing Missing Visions via Dialog for Scene Graph Generations
Zhenghao Zhao
Ye Zhu
Xiaoguang Zhu
Yuzhang Shang
Yan Yan
29
1
0
23 Apr 2022
Learning to Execute Actions or Ask Clarification Questions
Zhengxiang Shi
Yue Feng
Aldo Lipani
LM&Ro
16
44
0
18 Apr 2022
Co-VQA : Answering by Interactive Sub Question Sequence
Ruonan Wang
Yuxi Qian
Fangxiang Feng
Xiaojie Wang
Huixing Jiang
LRM
21
16
0
02 Apr 2022
Image Retrieval from Contextual Descriptions
Benno Krojer
Vaibhav Adlakha
Vibhav Vineet
Yash Goyal
E. Ponti
Siva Reddy
13
29
0
29 Mar 2022
Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Duo Zheng
Fandong Meng
Q. Si
Hairun Fan
Zipeng Xu
Jie Zhou
Fangxiang Feng
Xiaojie Wang
19
0
0
16 Mar 2022
Vision-Language Intelligence: Tasks, Representation Learning, and Large Models
Feng Li
Hao Zhang
Yi-Fan Zhang
S. Liu
Jian Guo
L. Ni
Pengchuan Zhang
Lei Zhang
AI4TS
VLM
24
36
0
03 Mar 2022
CAISE: Conversational Agent for Image Search and Editing
Hyounghun Kim
Doo Soon Kim
Seunghyun Yoon
Franck Dernoncourt
Trung Bui
Mohit Bansal
19
6
0
24 Feb 2022
Multimodal Interactions Using Pretrained Unimodal Models for SIMMC 2.0
Joosung Lee
Kijong Han
37
6
0
10 Dec 2021
Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction
Yikai Wang
Fuchun Sun
Wenbing Huang
Fengxiang He
Dacheng Tao
46
29
0
04 Dec 2021
Building Goal-Oriented Dialogue Systems with Situated Visual Context
Sanchit Agarwal
Jan Jezabek
Arijit Biswas
Emre Barut
Shuyang Gao
Tagyoung Chung
18
1
0
22 Nov 2021
Open-domain clarification question generation without question examples
Julia White
Gabriel Poesia
Robert D. Hawkins
Dorsa Sadigh
Noah D. Goodman
22
23
0
19 Oct 2021
A Framework for Learning to Request Rich and Contextually Useful Information from Humans
Khanh Nguyen
Yonatan Bisk
Hal Daumé
36
16
0
14 Oct 2021
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts
Shuhe Wang
Yuxian Meng
Xiaoya Li
Xiaofei Sun
Rongbin Ouyang
Jiwei Li
MLLM
VLM
27
21
0
27 Sep 2021
Learning Natural Language Generation from Scratch
Alice Martin Donati
Guillaume Quispe
Charles Ollion
Sylvain Le Corff
Florian Strub
Olivier Pietquin
LRM
18
4
0
20 Sep 2021
Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy
A. Testoni
Raffaella Bernardi
16
11
0
11 Sep 2021
We went to look for meaning and all we got were these lousy representations: aspects of meaning representation for computational semantics
Simon Dobnik
R. Cooper
Adam Ek
Bill Noble
Staffan Larsson
N. Ilinykh
Vladislav Maraev
Vidya Somashekarappa
19
0
0
10 Sep 2021
YouRefIt: Embodied Reference Understanding with Language and Gesture
Yixin Chen
Qing Li
Deqian Kong
Yik Lun Kei
Song-Chun Zhu
Tao Gao
Yixin Zhu
Siyuan Huang
LM&Ro
37
41
0
08 Sep 2021
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
Duo Zheng
Zipeng Xu
Fandong Meng
Xiaojie Wang
Jiaan Wang
Jie Zhou
18
12
0
06 Sep 2021
Chest ImaGenome Dataset for Clinical Reasoning
Joy T. Wu
Nkechinyere N. Agu
Ismini Lourentzou
Arjun Sharma
J. Paguio
...
William Mitchell
Satyananda Kashyap
Andrea Giovannini
L. A. Celi
Mehdi Moradi
16
64
0
31 Jul 2021
Language Grounding with 3D Objects
Jesse Thomason
Mohit Shridhar
Yonatan Bisk
Chris Paxton
Luke Zettlemoyer
LM&Ro
12
52
0
26 Jul 2021
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
39
16
0
23 Jul 2021
Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue
Zipeng Xu
Fandong Meng
Xiaojie Wang
Duo Zheng
Chenxu Lv
Jie Zhou
OffRL
28
5
0
12 Jul 2021
Target-dependent UNITER: A Transformer-Based Multimodal Language Comprehension Model for Domestic Service Robots
Shintaro Ishikawa
K. Sugiura
23
10
0
02 Jul 2021
Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue
Shoya Matsumori
Kosuke Shingyouchi
Yukikoko Abe
Yosuke Fukuchi
K. Sugiura
M. Imai
31
16
0
29 Jun 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
22
6
0
26 Jun 2021
C
3
C^3
C
3
: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues
Hung Le
Nancy F. Chen
S. Hoi
16
2
0
16 Jun 2021
Grounding 'Grounding' in NLP
Khyathi Raghavi Chandu
Yonatan Bisk
A. Black
22
51
0
04 Jun 2021
Maintaining Common Ground in Dynamic Environments
Takuma Udagawa
Akiko Aizawa
19
11
0
29 May 2021
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation
Tao Tu
Q. Ping
Govind Thattai
Gökhan Tür
Premkumar Natarajan
26
18
0
24 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Erik Cambria
54
267
0
10 May 2021
A recipe for annotating grounded clarifications
Luciana Benotti
P. Blackburn
26
17
0
18 Apr 2021
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Satwik Kottur
Seungwhan Moon
A. Geramifard
Babak Damavandi
16
87
0
18 Apr 2021
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
E. Razumovskaia
Goran Glavavs
Olga Majewska
E. Ponti
Anna Korhonen
Ivan Vulić
18
32
0
17 Apr 2021
Ensemble of MRR and NDCG models for Visual Dialog
Idan Schwartz
24
7
0
15 Apr 2021
Structured Co-reference Graph Attention for Video-grounded Dialogue
Junyeong Kim
Sunjae Yoon
Dahyun Kim
Chang-Dong Yoo
18
26
0
24 Mar 2021
The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues
A. Testoni
Raffaella Bernardi
15
4
0
20 Mar 2021
Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training
A. Testoni
Raffaella Bernardi
20
2
0
20 Mar 2021
I Want This Product but Different : Multimodal Retrieval with Synthetic Query Expansion
Ivona Tautkute
Tomasz Trzciñski
21
4
0
17 Feb 2021
Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue
Han Liu
Caixia Yuan
Xiaojie Wang
Yushu Yang
Huixing Jiang
Zhongyuan Wang
27
1
0
04 Feb 2021
An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games
Alessandro Suglia
Yonatan Bisk
Ioannis Konstas
Antonio Vergari
E. Bastianelli
Andrea Vanzo
Oliver Lemon
18
8
0
31 Jan 2021
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks
Hongyin Luo
Shang-Wen Li
James R. Glass
16
9
0
24 Jan 2021
DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
Hung Le
Chinnadhurai Sankar
Seungwhan Moon
Ahmad Beirami
A. Geramifard
Satwik Kottur
VGen
24
18
0
01 Jan 2021
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual Contexts
Yuxian Meng
Shuhe Wang
Qinghong Han
Xiaofei Sun
Fei Wu
Rui Yan
Jiwei Li
21
28
0
30 Dec 2020
Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs
Emanuele Bugliarello
Ryan Cotterell
Naoaki Okazaki
Desmond Elliott
24
119
0
30 Nov 2020
Previous
1
2
3
4
5
Next