Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.04224
Cited By
Visual Causal Scene Refinement for Video Question Answering
7 May 2023
Yushen Wei
Yang Liu
Hongfei Yan
Guanbin Li
Liang Lin
CML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visual Causal Scene Refinement for Video Question Answering"
7 / 7 papers shown
Title
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song
Weixing Chen
Y. Liu
Weikai Chen
Guanbin Li
Liang Lin
117
3
0
12 Dec 2024
Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering
Haibo Wang
Chenghang Lai
Yixuan Sun
Weifeng Ge
13
5
0
19 Jan 2024
Cross-Modal Causal Intervention for Medical Report Generation
Weixing Chen
Yang Liu
Ce Wang
Jiarui Zhu
Shen Zhao
Guanbin Li
Cheng-Lin Liu
Liang Lin
19
5
0
16 Mar 2023
Video Graph Transformer for Video Question Answering
Junbin Xiao
Pan Zhou
Tat-Seng Chua
Shuicheng Yan
ViT
134
73
0
12 Jul 2022
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering
Jungin Park
Jiyoung Lee
K. Sohn
123
99
0
29 Apr 2021
Counterfactual Samples Synthesizing for Robust Visual Question Answering
Long Chen
Xin Yan
Jun Xiao
Hanwang Zhang
Shiliang Pu
Yueting Zhuang
OOD
AAML
132
287
0
14 Mar 2020
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
261
10,106
0
16 Nov 2016
1