ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.05548
  4. Cited By
Reasoning Visual Dialogs with Structural and Partial Observations

Reasoning Visual Dialogs with Structural and Partial Observations

11 April 2019
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
ArXivPDFHTML

Papers citing "Reasoning Visual Dialogs with Structural and Partial Observations"

50 / 55 papers shown
Title
Natural Language Understanding and Inference with MLLM in Visual
  Question Answering: A Survey
Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
Jiayi Kuang
Jingyou Xie
Haohao Luo
Ronghao Li
Zhe Xu
Xianfeng Cheng
Yinghui Li
Xika Lin
Ying Shen
LRM
85
8
0
26 Nov 2024
ChartKG: A Knowledge-Graph-Based Representation for Chart Images
ChartKG: A Knowledge-Graph-Based Representation for Chart Images
Zhiguang Zhou
Haoxuan Wang
Zhengqing Zhao
Fengling Zheng
Yongheng Wang
Wei Chen
Yong Wang
17
0
0
13 Oct 2024
Navigation Instruction Generation with BEV Perception and Large Language
  Models
Navigation Instruction Generation with BEV Perception and Large Language Models
Sheng Fan
Rui Liu
Wenguan Wang
Yi Yang
34
5
0
21 Jul 2024
Revisiting Referring Expression Comprehension Evaluation in the Era of
  Large Multimodal Models
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models
Jierun Chen
Fangyun Wei
Jinjing Zhao
Sizhe Song
Bohuai Wu
Zhuoxuan Peng
S.-H. Gary Chan
Hongyang R. Zhang
30
8
0
24 Jun 2024
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large
  Multimodal and Language Models
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models
Bingbing Wen
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Bill Howe
Lijuan Wang
MLLM
23
1
0
21 Dec 2023
$\mathbb{VD}$-$\mathbb{GR}$: Boosting $\mathbb{V}$isual
  $\mathbb{D}$ialog with Cascaded Spatial-Temporal Multi-Modal
  $\mathbb{GR}$aphs
VD\mathbb{VD}VD-GR\mathbb{GR}GR: Boosting V\mathbb{V}Visual D\mathbb{D}Dialog with Cascaded Spatial-Temporal Multi-Modal GR\mathbb{GR}GRaphs
Adnen Abdessaied
Lei Shi
Andreas Bulling
3DH
11
3
0
25 Oct 2023
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic
  Understanding with Scene and Topic Transitions
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions
Yuxuan Wang
Zilong Zheng
Xueliang Zhao
Jinpeng Li
Yueqian Wang
Dongyan Zhao
VGen
9
9
0
30 May 2023
IRRGN: An Implicit Relational Reasoning Graph Network for Multi-turn
  Response Selection
IRRGN: An Implicit Relational Reasoning Graph Network for Multi-turn Response Selection
Jingcheng Deng
Hengwei Dai
Xuewei Guo
Yuanchen Ju
Wei Peng
LRM
4
2
0
01 Dec 2022
Unified Multimodal Model with Unlikelihood Training for Visual Dialog
Unified Multimodal Model with Unlikelihood Training for Visual Dialog
Zihao Wang
Junli Wang
Changjun Jiang
MLLM
14
10
0
23 Nov 2022
Neuro-Symbolic Visual Dialog
Neuro-Symbolic Visual Dialog
Adnen Abdessaied
Mihai Bâce
Andreas Bulling
NAI
14
1
0
22 Aug 2022
One for All: One-stage Referring Expression Comprehension with Dynamic
  Reasoning
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning
Zhipeng Zhang
Zhimin Wei
Zhongzhen Huang
Rui Niu
Peng Wang
ObjD
LRM
9
9
0
31 Jul 2022
Adversarial Robustness of Visual Dialog
Adversarial Robustness of Visual Dialog
Lu Yu
Verena Rieser
AAML
13
0
0
06 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context
  Augmented Dialogue System: A Review
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
17
2
0
02 Jul 2022
The Dialog Must Go On: Improving Visual Dialog via Generative
  Self-Training
The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training
Gi-Cheon Kang
Sungdong Kim
Jin-Hwa Kim
Donghyun Kwak
Byoung-Tak Zhang
8
10
0
25 May 2022
UTC: A Unified Transformer with Inter-Task Contrastive Learning for
  Visual Dialog
UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog
Cheng Chen
Yudong Zhu
Zhenshan Tan
Qingrong Cheng
Xin Jiang
Qun Liu
X. Gu
20
39
0
01 May 2022
Improving Cross-Modal Understanding in Visual Dialog via Contrastive
  Learning
Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Feilong Chen
Xiuyi Chen
Shuang Xu
Bo Xu
VLM
18
18
0
15 Apr 2022
Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog
Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog
Shunyu Zhang
X. Jiang
Zequn Yang
T. Wan
Zengchang Qin
17
12
0
10 Apr 2022
Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene
  Graphs with Language Structures via Dependency Relationships
Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships
Chao Lou
Wenjuan Han
Yuh-Chen Lin
Zilong Zheng
CoGe
14
10
0
27 Mar 2022
Rich Action-semantic Consistent Knowledge for Early Action Prediction
Rich Action-semantic Consistent Knowledge for Early Action Prediction
Xiaoli Liu
Jianqin Yin
Dianming Guo
Huaping Liu
29
2
0
23 Jan 2022
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset
  with Visual Contexts
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts
Shuhe Wang
Yuxian Meng
Xiaoya Li
Xiaofei Sun
Rongbin Ouyang
Jiwei Li
MLLM
VLM
8
17
0
27 Sep 2021
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
Feilong Chen
Xiuyi Chen
Fandong Meng
Peng Li
Jie Zhou
65
34
0
17 Sep 2021
Learning to Ground Visual Objects for Visual Dialog
Learning to Ground Visual Objects for Visual Dialog
Feilong Chen
Xiuyi Chen
Can Xu
Daxin Jiang
OOD
15
17
0
13 Sep 2021
Relation-aware Compositional Zero-shot Learning for Attribute-Object
  Pair Recognition
Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition
Ziwei Xu
Guangzhi Wang
Yongkang Wong
Mohan S. Kankanhalli
30
26
0
10 Aug 2021
Learning Multi-Attention Context Graph for Group-Based Re-Identification
Learning Multi-Attention Context Graph for Group-Based Re-Identification
Yichao Yan
Jie Qin
Bingbing Ni
Jiaxin Chen
Li Liu
Fan Zhu
Weishi Zheng
Xiaokang Yang
Ling Shao
18
32
0
29 Apr 2021
Ensemble of MRR and NDCG models for Visual Dialog
Ensemble of MRR and NDCG models for Visual Dialog
Idan Schwartz
14
3
0
15 Apr 2021
Learning Reasoning Paths over Semantic Graphs for Video-grounded
  Dialogues
Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues
Hung Le
Nancy F. Chen
S. Hoi
18
14
0
01 Mar 2021
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual
  Contexts
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual Contexts
Yuxian Meng
Shuhe Wang
Qinghong Han
Xiaofei Sun
Fei Wu
Rui Yan
Jiwei Li
8
25
0
30 Dec 2020
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Zhaokai Wang
Renda Bao
Qi Wu
Si Liu
8
26
0
07 Dec 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual
  Questions
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
22
19
0
24 Oct 2020
A Linguistic Analysis of Visually Grounded Dialogues Based on Spatial
  Expressions
A Linguistic Analysis of Visually Grounded Dialogues Based on Spatial Expressions
Takuma Udagawa
T. Yamazaki
Akiko Aizawa
12
10
0
07 Oct 2020
Referring Image Segmentation via Cross-Modal Progressive Comprehension
Referring Image Segmentation via Cross-Modal Progressive Comprehension
Shaofei Huang
Tianrui Hui
Si Liu
Guanbin Li
Yunchao Wei
Jizhong Han
Luoqi Liu
Bo-wen Li
EgoV
8
173
0
01 Oct 2020
KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning
  in Visual Dialogue
KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue
X. Jiang
Siyi Du
Zengchang Qin
Yajing Sun
J. Yu
8
32
0
11 Aug 2020
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space
Liu Yang
VLM
11
5
0
02 Aug 2020
Referring Expression Comprehension: A Survey of Methods and Datasets
Referring Expression Comprehension: A Survey of Methods and Datasets
Yanyuan Qiao
Chaorui Deng
Qi Wu
ObjD
29
74
0
19 Jul 2020
Active Visual Information Gathering for Vision-Language Navigation
Active Visual Information Gathering for Vision-Language Navigation
Hanqing Wang
Wenguan Wang
Tianmin Shu
Wei Liang
Jianbing Shen
14
65
0
15 Jul 2020
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed
  and Non-repetitive Responses in Visual Dialogue
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue
X. Jiang
J. Yu
Yajing Sun
Zengchang Qin
Zihao Zhu
Yue Hu
Qi Wu
MLLM
16
19
0
07 Jul 2020
Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation
Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation
Guolei Sun
Wenguan Wang
Jifeng Dai
Luc Van Gool
6
280
0
03 Jul 2020
ORD: Object Relationship Discovery for Visual Dialogue Generation
ORD: Object Relationship Discovery for Visual Dialogue Generation
Ziwei Wang
Zi Huang
Yadan Luo
Huimin Lu
6
4
0
15 Jun 2020
VD-BERT: A Unified Vision and Dialog Transformer with BERT
VD-BERT: A Unified Vision and Dialog Transformer with BERT
Yue Wang
Shafiq R. Joty
Michael R. Lyu
Irwin King
Caiming Xiong
S. Hoi
6
94
0
28 Apr 2020
Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike
  Common Sense
Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense
Yixin Zhu
Tao Gao
Lifeng Fan
Siyuan Huang
Mark Edmonds
...
Chi Zhang
Siyuan Qi
Ying Nian Wu
J. Tenenbaum
Song-Chun Zhu
9
113
0
20 Apr 2020
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge
  Transfer
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer
Gi-Cheon Kang
Junseok Park
Hwaran Lee
Byoung-Tak Zhang
Jin-Hwa Kim
VLM
6
8
0
14 Apr 2020
Iterative Context-Aware Graph Inference for Visual Dialog
Iterative Context-Aware Graph Inference for Visual Dialog
Dan Guo
Haibo Wang
Hanwang Zhang
Zhengjun Zha
Meng Wang
6
49
0
05 Apr 2020
Vision-Dialog Navigation by Exploring Cross-modal Memory
Vision-Dialog Navigation by Exploring Cross-modal Memory
Yi Zhu
Fengda Zhu
Zhaohuan Zhan
Bingqian Lin
Jianbin Jiao
Xiaojun Chang
Xiaodan Liang
VLM
14
49
0
15 Mar 2020
Hierarchical Human Parsing with Typed Part-Relation Reasoning
Hierarchical Human Parsing with Typed Part-Relation Reasoning
Wenguan Wang
Hailong Zhu
Jifeng Dai
Yanwei Pang
Jianbing Shen
Ling Shao
10
102
0
10 Mar 2020
Guessing State Tracking for Visual Dialogue
Guessing State Tracking for Visual Dialogue
Wei Pang
Xiaojie Wang
OOD
13
10
0
24 Feb 2020
Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks
Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks
Wenguan Wang
Xiankai Lu
Jianbing Shen
David J. Crandall
Ling Shao
VOS
8
269
0
19 Jan 2020
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art
  Baseline
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Vishvak Murahari
Dhruv Batra
Devi Parikh
Abhishek Das
VLM
11
115
0
05 Dec 2019
Efficient Attention Mechanism for Visual Dialog that can Handle All the
  Interactions between Multiple Inputs
Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
8
7
0
26 Nov 2019
Two Causal Principles for Improving Visual Dialog
Two Causal Principles for Improving Visual Dialog
Jiaxin Qi
Yulei Niu
Jianqiang Huang
Hanwang Zhang
CML
6
145
0
24 Nov 2019
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in
  Visual Dialogue
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue
X. Jiang
J. Yu
Zengchang Qin
Yingying Zhuang
Xingxing Zhang
Yue Hu
Qi Wu
15
61
0
17 Nov 2019
12
Next