Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.12647
Cited By
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering
26 July 2022
Yang Liu
Guanbin Li
Liang Lin
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering"
12 / 12 papers shown
Title
Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification
Xiaoshuo Yan
Z. Li
Lei Meng
Zhuang Qi
Wei Wu
Zixuan Li
X. Meng
CML
BDL
24
0
0
13 May 2025
Structure Causal Models and LLMs Integration in Medical Visual Question Answering
Zibo Xu
Qiang Li
Weizhi Nie
Weijie Wang
Anan Liu
CML
MedIm
35
0
0
05 May 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Kaixuan Jiang
Y. Liu
Weixing Chen
Jingzhou Luo
Ziliang Chen
Ling Pan
G. Li
Liang Lin
51
2
0
14 Mar 2025
An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT
Shreya Singh
36
0
0
24 Feb 2025
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song
Weixing Chen
Y. Liu
Weikai Chen
Guanbin Li
Liang Lin
117
3
0
12 Dec 2024
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
Haopeng Li
Andong Deng
Qiuhong Ke
Jun Liu
Hossein Rahmani
Yulan Guo
Mohammed Bennamoun
Chen Chen
37
17
0
03 Jan 2024
Urban Regional Function Guided Traffic Flow Prediction
Kuo Wang
Lingbo Liu
Yang Liu
Guanbin Li
Fan Zhou
Liang Lin
25
25
0
17 Mar 2023
Cross-Modal Causal Intervention for Medical Report Generation
Weixing Chen
Yang Liu
Ce Wang
Jiarui Zhu
Shen Zhao
Guanbin Li
Cheng-Lin Liu
Liang Lin
19
5
0
16 Mar 2023
Self-supervised Contrastive Learning for Audio-Visual Action Recognition
Yang Liu
Y. Tan
Haoyu Lan
SSL
34
5
0
28 Apr 2022
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering
Jungin Park
Jiyoung Lee
K. Sohn
120
99
0
29 Apr 2021
Counterfactual Samples Synthesizing for Robust Visual Question Answering
Long Chen
Xin Yan
Jun Xiao
Hanwang Zhang
Shiliang Pu
Yueting Zhuang
OOD
AAML
132
287
0
14 Mar 2020
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
261
10,106
0
16 Nov 2016
1