v1v2 (latest)

GuessWhat?! Visual object discovery through multi-modal dialogue

23 November 2016

Olivier Pietquin

Aaron Courville

Papers citing "GuessWhat?! Visual object discovery through multi-modal dialogue"

50 / 237 papers shown

Title
LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue Anthony Sicilia Malihe Alikhani 237 6 0 14 Oct 2022
Understanding Embodied Reference with Touch-Line TransformerInternational Conference on Learning Representations (ICLR), 2022 Yongqian Li Xiaoxue Chen Hao Zhao Jiangtao Gong Guyue Zhou Federico Rossano Yixin Zhu 254 20 0 11 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of DataIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 Ye Zhu Yuehua Wu Andrii Zadaianchuk Yan Yan 342 37 0 05 Oct 2022
Learning More May Not Be Better: Knowledge Transferability in Vision and Language TasksJournal of Imaging (JI), 2022 Tianwei Chen Noa Garcia Mayu Otani Chenhui Chu Yuta Nakashima Hajime Nagahara VLM 120 1 0 23 Aug 2022
Modeling Non-Cooperative Dialogue: Theoretical and Empirical InsightsTransactions of the Association for Computational Linguistics (TACL), 2022 Anthony Sicilia Tristan D. Maidment Pat Healy Malihe Alikhani 135 4 0 15 Jul 2022
Scene-Aware Prompt for Multi-modal Dialogue Understanding and GenerationNatural Language Processing and Chinese Computing (NLPCC), 2022 Bin Li Yixuan Weng Ziyu Ma Bin Sun Shutao Li VLM 129 2 0 05 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review Hao Wang Bin Guo Y. Zeng Yasan Ding Chen Qiu Ying Zhang Li Yao Zhiwen Yu 229 2 0 02 Jul 2022
Multimodal Dialogue State TrackingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022 Hung Le Nancy F. Chen Guosheng Lin 136 10 0 16 Jun 2022
FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear ModulationNeural Information Processing Systems (NeurIPS), 2022 Mehmet Özgür Türkoglu Alexander Becker H. Gündüz Mina Rezaei B. Bischl Rodrigo Caye Daudt Stefano Dáronco Jan Dirk Wegner Konrad Schindler FedML UQCV 510 33 0 31 May 2022
The Dialog Must Go On: Improving Visual Dialog via Generative Self-TrainingComputer Vision and Pattern Recognition (CVPR), 2022 Gi-Cheon Kang Sungdong Kim Jin-Hwa Kim Donghyun Kwak Byoung-Tak Zhang 274 16 0 25 May 2022
Multimodal Conversational AI: A Survey of Datasets and Approaches Anirudh S. Sundar Larry Heck 158 32 0 13 May 2022
Supplementing Missing Visions via Dialog for Scene Graph GenerationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 Zhenghao Zhao Ye Zhu Xiaoguang Zhu Yuzhang Shang Yan Yan 166 1 0 23 Apr 2022
Learning to Execute Actions or Ask Clarification Questions Zhengxiang Shi Yue Feng Aldo Lipani LM&Ro 224 48 0 18 Apr 2022
Co-VQA : Answering by Interactive Sub Question SequenceFindings (Findings), 2022 Ruonan Wang Yuxi Qian Fangxiang Feng Xiaojie Wang Huixing Jiang LRM 156 19 0 02 Apr 2022
Image Retrieval from Contextual DescriptionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Benno Krojer Vaibhav Adlakha Vibhav Vineet Yash Goyal Edoardo Ponti Siva Reddy 244 38 0 29 Mar 2022
Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene Duo Zheng Fandong Meng Q. Si Hairun Fan Zipeng Xu Jie Zhou Fangxiang Feng Xiaojie Wang 158 0 0 16 Mar 2022
Vision-Language Intelligence: Tasks, Representation Learning, and Large Models Feng Li Hao Zhang Yi-Fan Zhang Shixuan Liu Jian Guo L. Ni Pengchuan Zhang Lei Zhang AI4TS VLM 189 40 0 03 Mar 2022
CAISE: Conversational Agent for Image Search and EditingAAAI Conference on Artificial Intelligence (AAAI), 2022 Hyounghun Kim Doo Soon Kim Seunghyun Yoon Franck Dernoncourt Trung Bui Joey Tianyi Zhou 204 6 0 24 Feb 2022
Multimodal Interactions Using Pretrained Unimodal Models for SIMMC 2.0 Joosung Lee Kijong Han 185 6 0 10 Dec 2021
Channel Exchanging Networks for Multimodal and Multitask Dense Image PredictionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 Yikai Wang Gang Hua Wenbing Huang Fengxiang He Dacheng Tao 257 43 0 04 Dec 2021
Building Goal-Oriented Dialogue Systems with Situated Visual ContextAAAI Conference on Artificial Intelligence (AAAI), 2021 Sanchit Agarwal Jan Jezabek Arijit Biswas Emre Barut Shuyang Gao Tagyoung Chung 145 1 0 22 Nov 2021
Open-domain clarification question generation without question examples Julia White Gabriel Poesia Robert D. Hawkins Dorsa Sadigh Noah D. Goodman 95 28 0 19 Oct 2021
A Framework for Learning to Request Rich and Contextually Useful Information from Humans Khanh Nguyen Yonatan Bisk Hal Daumé 452 20 0 14 Oct 2021
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts Shuhe Wang Yuxian Meng Xiaoya Li Xiaofei Sun Rongbin Ouyang Jiwei Li MLLM VLM 211 23 0 27 Sep 2021
Learning Natural Language Generation from Scratch Alice Martin Donati Guillaume Quispe Charles Ollion Sylvain Le Corff Florian Strub Olivier Pietquin LRM 131 4 0 20 Sep 2021
Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy A. Testoni Raffaella Bernardi 127 11 0 11 Sep 2021
We went to look for meaning and all we got were these lousy representations: aspects of meaning representation for computational semantics Simon Dobnik R. Cooper Adam Ek Bill Noble Staffan Larsson N. Ilinykh Vladislav Maraev Vidya Somashekarappa 130 0 0 10 Sep 2021
YouRefIt: Embodied Reference Understanding with Language and GestureIEEE International Conference on Computer Vision (ICCV), 2021 Yixin Chen Qing Li Deqian Kong Yik Lun Kei Song-Chun Zhu Tao Gao Yixin Zhu Siyuan Huang LM&Ro 223 48 0 08 Sep 2021
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented GuesserConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 Duo Zheng Zipeng Xu Fandong Meng Caixia Yuan Jiaan Wang Jie Zhou 102 13 0 06 Sep 2021
Chest ImaGenome Dataset for Clinical Reasoning Joy T. Wu Nkechinyere N. Agu Ismini Lourentzou Arjun Sharma J. Paguio ... William Mitchell Satyananda Kashyap Andrea Giovannini Leo Anthony Celi Mehdi Moradi 227 89 0 31 Jul 2021
Language Grounding with 3D ObjectsConference on Robot Learning (CoRL), 2021 Jesse Thomason Mohit Shridhar Yonatan Bisk Chris Paxton Luke Zettlemoyer LM&Ro 198 54 0 26 Jul 2021
Adversarial Reinforced Instruction Attacker for Robust Vision-Language NavigationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 Bingqian Lin Yi Zhu Yanxin Long Xiaodan Liang QiXiang Ye Liang Lin AAML 190 20 0 23 Jul 2021
Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue Zipeng Xu Fandong Meng Caixia Yuan Duo Zheng Chenxu Lv Jie Zhou OffRL 161 6 0 12 Jul 2021
Target-dependent UNITER: A Transformer-Based Multimodal Language Comprehension Model for Domestic Service Robots Shintaro Ishikawa K. Sugiura 164 13 0 02 Jul 2021
Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual DialogueIEEE International Conference on Computer Vision (ICCV), 2021 Shoya Matsumori Kosuke Shingyouchi Yukikoko Abe Yosuke Fukuchi K. Sugiura M. Imai 168 17 0 29 Jun 2021
Saying the Unseen: Video Descriptions via Dialog AgentsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 Ye Zhu Yu Wu Yi Yang Yan Yan 190 8 0 26 Jun 2021
$C^3$ : Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues Hung Le Nancy F. Chen Guosheng Lin 107 2 0 16 Jun 2021
Grounding 'Grounding' in NLPFindings (Findings), 2021 Khyathi Chandu Yonatan Bisk A. Black 149 57 0 04 Jun 2021
Maintaining Common Ground in Dynamic EnvironmentsTransactions of the Association for Computational Linguistics (TACL), 2021 Takuma Udagawa Akiko Aizawa 160 15 0 29 May 2021
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic RepresentationComputer Vision and Pattern Recognition (CVPR), 2021 Tao Tu Q. Ping Govind Thattai Gokhan Tur Premkumar Natarajan 133 18 0 24 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic SurveyArtificial Intelligence Review (AIR), 2021 Jinjie Ni Tom Young Vlad Pandelea Fuzhao Xue Xiaoshi Zhong 779 320 0 10 May 2021
A recipe for annotating grounded clarificationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021 Luciana Benotti P. Blackburn 82 19 0 18 Apr 2021
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal ConversationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 Satwik Kottur Seungwhan Moon A. Geramifard Babak Damavandi 220 98 0 18 Apr 2021
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue SystemsJournal of Artificial Intelligence Research (JAIR), 2021 E. Razumovskaia Goran Glavaš Olga Majewska Edoardo Ponti Anna Korhonen Ivan Vulić 481 37 0 17 Apr 2021
Ensemble of MRR and NDCG models for Visual DialogNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021 Idan Schwartz 226 10 0 15 Apr 2021
Structured Co-reference Graph Attention for Video-grounded DialogueAAAI Conference on Artificial Intelligence (AAAI), 2021 Junyeong Kim Sunjae Yoon Dahyun Kim Chang D. Yoo 202 29 0 24 Mar 2021
The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual DialoguesConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 A. Testoni Raffaella Bernardi 82 4 0 20 Mar 2021
Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own TrainingItalian Conference on Computational Linguistics (CLiC-it), 2021 A. Testoni Raffaella Bernardi 99 2 0 20 Mar 2021
I Want This Product but Different : Multimodal Retrieval with Synthetic Query Expansion Ivona Tautkute Tomasz Trzciñski 219 5 0 17 Feb 2021
Converse, Focus and Guess -- Towards Multi-Document Driven DialogueAAAI Conference on Artificial Intelligence (AAAI), 2021 Han Liu Caixia Yuan Xiaojie Wang Yushu Yang Huixing Jiang Zhongyuan Wang 205 1 0 04 Feb 2021