ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.03493
  4. Cited By
Causal Attention for Vision-Language Tasks

Causal Attention for Vision-Language Tasks

5 March 2021
Xu Yang
Hanwang Zhang
Guojun Qi
Jianfei Cai
    CML
ArXivPDFHTML

Papers citing "Causal Attention for Vision-Language Tasks"

50 / 79 papers shown
Title
Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification
Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification
Xiaoshuo Yan
Z. Li
Lei Meng
Zhuang Qi
Wei Wu
Zixuan Li
X. Meng
CML
BDL
28
0
0
13 May 2025
A Generative Re-ranking Model for List-level Multi-objective Optimization at Taobao
A Generative Re-ranking Model for List-level Multi-objective Optimization at Taobao
Yue Meng
Cheng Guo
Yi Cao
Tong Liu
Bo Zheng
11
0
0
12 May 2025
Structure Causal Models and LLMs Integration in Medical Visual Question Answering
Structure Causal Models and LLMs Integration in Medical Visual Question Answering
Zibo Xu
Qiang Li
Weizhi Nie
Weijie Wang
Anan Liu
CML
MedIm
35
0
0
05 May 2025
Bayesian Cross-Modal Alignment Learning for Few-Shot Out-of-Distribution Generalization
Bayesian Cross-Modal Alignment Learning for Few-Shot Out-of-Distribution Generalization
Lin Zhu
Yifeng Yang
Zichao Nie
Yuan Gao
VLM
28
3
0
13 Apr 2025
Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention
Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention
Moyang Liu
Kaiying Yan
Yukun Liu
Ruibo Fu
Zhengqi Wen
Xuefei Liu
Chenxing Li
26
0
0
12 Apr 2025
CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data
Disheng Liu
Yiran Qiao
Wuche Liu
Yiren Lu
Yunlai Zhou
Tuo Liang
Yu Yin
Jing Ma
CML
3DV
54
0
0
06 Mar 2025
Omni-SILA: Towards Omni-scene Driven Visual Sentiment Identifying, Locating and Attributing in Videos
Jiamin Luo
Jingjing Wang
Junxiao Ma
Yujie Jin
Shoushan Li
Guodong Zhou
31
0
0
26 Feb 2025
Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large
  Vision-Language Model via Causality Analysis
Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large Vision-Language Model via Causality Analysis
Po-Hsuan Huang
Jeng-Lin Li
Chin-Po Chen
Ming-Ching Chang
Wei-Chao Chen
LRM
72
1
0
04 Dec 2024
Time-Causal VAE: Robust Financial Time Series Generator
Time-Causal VAE: Robust Financial Time Series Generator
Beatrice Acciaio
Stephan Eckstein
Songyan Hou
AI4TS
23
2
0
05 Nov 2024
Causality for Large Language Models
Causality for Large Language Models
Anpeng Wu
Kun Kuang
Minqin Zhu
Yingrong Wang
Yujia Zheng
Kairong Han
B. Li
Guangyi Chen
Fei Wu
Kun Zhang
LRM
46
6
0
20 Oct 2024
Fine-Tuning Pre-trained Language Models for Robust Causal Representation
  Learning
Fine-Tuning Pre-trained Language Models for Robust Causal Representation Learning
Jialin Yu
Yuxiang Zhou
Yulan He
Nevin L. Zhang
Ricardo Silva
31
0
0
18 Oct 2024
MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
Tieyuan Chen
Huabin Liu
Tianyao He
Yihang Chen
Chaofan Gan
...
Cheng Zhong
Yang Zhang
Yingxue Wang
Hui Lin
Weiyao Lin
VGen
CML
35
4
0
26 Sep 2024
Towards Deconfounded Image-Text Matching with Causal Inference
Towards Deconfounded Image-Text Matching with Causal Inference
Wenhui Li
Xinqi Su
Dan Song
Lanjun Wang
Kun Zhang
An-An Liu
BDL
CML
40
10
0
22 Aug 2024
Causal Interventional Prediction System for Robust and Explainable
  Effect Forecasting
Causal Interventional Prediction System for Robust and Explainable Effect Forecasting
Zhixuan Chu
Hui Ding
Guang Zeng
Shiyu Wang
Yiming Li
CML
25
1
0
29 Jul 2024
Causality-inspired Discriminative Feature Learning in Triple Domains for
  Gait Recognition
Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition
Haijun Xiong
Bin Feng
Xinggang Wang
Wenyu Liu
CVBM
29
2
0
17 Jul 2024
Visual Language Model based Cross-modal Semantic Communication Systems
Visual Language Model based Cross-modal Semantic Communication Systems
Feibo Jiang
Chuanguo Tang
Li Dong
Kezhi Wang
Kun Yang
Cunhua Pan
VLM
31
2
0
06 May 2024
Vision-and-Language Navigation via Causal Learning
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
39
13
0
16 Apr 2024
De-confounded Data-free Knowledge Distillation for Handling Distribution
  Shifts
De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts
Yuzheng Wang
Dingkang Yang
Zhaoyu Chen
Yang Liu
Siao Liu
Wenqiang Zhang
Lihua Zhang
Lizhe Qi
32
6
0
28 Mar 2024
CASPER: Causality-Aware Spatiotemporal Graph Neural Networks for
  Spatiotemporal Time Series Imputation
CASPER: Causality-Aware Spatiotemporal Graph Neural Networks for Spatiotemporal Time Series Imputation
Baoyu Jing
Dawei Zhou
Kan Ren
Carl Yang
CML
AI4TS
27
6
0
18 Mar 2024
Causal Prompting: Debiasing Large Language Model Prompting based on
  Front-Door Adjustment
Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment
Congzhi Zhang
Linhai Zhang
Jialong Wu
Deyu Zhou
Guoqiang Xu
CML
AI4CE
LRM
44
15
0
05 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
62
12
0
05 Mar 2024
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral
  Pedestrian Detection
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Taeheon Kim
Sebin Shin
Youngjoon Yu
Hak Gu Kim
Y. Ro
25
5
0
02 Mar 2024
How to Understand "Support"? An Implicit-enhanced Causal Inference
  Approach for Weakly-supervised Phrase Grounding
How to Understand "Support"? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding
Jiamin Luo
Jianing Zhao
Jingjing Wang
Guodong Zhou
36
0
0
29 Feb 2024
Can Large Language Models Learn Independent Causal Mechanisms?
Can Large Language Models Learn Independent Causal Mechanisms?
Gael Gendron
Bao Trung Nguyen
A. Peng
Michael Witbrock
Gillian Dobbie
LRM
10
3
0
04 Feb 2024
Bootstrapping OTS-Funcimg Pre-training Model (Botfip) -- A Comprehensive
  Symbolic Regression Framework
Bootstrapping OTS-Funcimg Pre-training Model (Botfip) -- A Comprehensive Symbolic Regression Framework
Tianhao Chen
Pengbo Xu
Haibiao Zheng
AI4CE
19
4
0
18 Jan 2024
Causality is all you need
Causality is all you need
Ning Xu
Yifei Gao
Hongshuo Tian
Yongdong Zhang
An-An Liu
23
0
0
21 Nov 2023
Improving Vision-and-Language Reasoning via Spatial Relations Modeling
Improving Vision-and-Language Reasoning via Spatial Relations Modeling
Cheng Yang
Rui Xu
Ye Guo
Peixiang Huang
Yiru Chen
Wenkui Ding
Zhongyuan Wang
Hong Zhou
LRM
8
5
0
09 Nov 2023
Accurate Use of Label Dependency in Multi-Label Text Classification
  Through the Lens of Causality
Accurate Use of Label Dependency in Multi-Label Text Classification Through the Lens of Causality
Caoyun Fan
Wenqing Chen
Jidong Tian
Yitian Li
Hao He
Yaohui Jin
28
6
0
11 Oct 2023
Causal Unsupervised Semantic Segmentation
Causal Unsupervised Semantic Segmentation
Junho Kim
Byung-Kwan Lee
Yonghyun Ro
23
18
0
11 Oct 2023
Beyond Generation: Harnessing Text to Image Models for Object Detection
  and Segmentation
Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Yunhao Ge
Jiashu Xu
Brian Nlong Zhao
Neel Joshi
Laurent Itti
Vibhav Vineet
DiffM
30
14
0
12 Sep 2023
Measuring the Effect of Causal Disentanglement on the Adversarial
  Robustness of Neural Network Models
Measuring the Effect of Causal Disentanglement on the Adversarial Robustness of Neural Network Models
Preben Ness
D. Marijan
Sunanda Bose
CML
11
0
0
21 Aug 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal
  View with Possible Worlds VQA
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA
A. Vosoughi
Shijian Deng
Songyang Zhang
Yapeng Tian
Chenliang Xu
Jiebo Luo
CML
21
3
0
31 May 2023
Semantic Composition in Visually Grounded Language Models
Semantic Composition in Visually Grounded Language Models
Rohan Pandey
CoGe
16
1
0
15 May 2023
Simple Token-Level Confidence Improves Caption Correctness
Simple Token-Level Confidence Improves Caption Correctness
Suzanne Petryk
Spencer Whitehead
Joseph E. Gonzalez
Trevor Darrell
Anna Rohrbach
Marcus Rohrbach
18
7
0
11 May 2023
Clothes-Invariant Feature Learning by Causal Intervention for
  Clothes-Changing Person Re-identification
Clothes-Invariant Feature Learning by Causal Intervention for Clothes-Changing Person Re-identification
Xulin Li
Yan Lu
B. Liu
Yuenan Hou
Yating Liu
Qi Chu
Wanli Ouyang
Nenghai Yu
OOD
CML
14
4
0
10 May 2023
Visual Causal Scene Refinement for Video Question Answering
Visual Causal Scene Refinement for Video Question Answering
Yushen Wei
Yang Liu
Hongfei Yan
Guanbin Li
Liang Lin
CML
12
20
0
07 May 2023
Transforming Visual Scene Graphs to Image Captions
Transforming Visual Scene Graphs to Image Captions
Xu Yang
Jiawei Peng
Zihua Wang
Haiyang Xu
Qinghao Ye
Chenliang Li
Mingshi Yan
Feisi Huang
Zhangzikang Li
Yu Zhang
37
18
0
03 May 2023
VCD: Visual Causality Discovery for Cross-Modal Question Reasoning
VCD: Visual Causality Discovery for Cross-Modal Question Reasoning
Y. Liu
Guanbin Li
Jingzhou Luo
Liang Lin
BDL
LRM
38
5
0
17 Apr 2023
ImageCaptioner$^2$: Image Captioner for Image Captioning Bias
  Amplification Assessment
ImageCaptioner2^22: Image Captioner for Image Captioning Bias Amplification Assessment
Eslam Mohamed Bakr
Pengzhan Sun
Erran L. Li
Mohamed Elhoseiny
15
6
0
10 Apr 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
33
46
0
21 Mar 2023
Cross-Modal Causal Intervention for Medical Report Generation
Cross-Modal Causal Intervention for Medical Report Generation
Weixing Chen
Yang Liu
Ce Wang
Jiarui Zhu
Shen Zhao
Guanbin Li
Cheng-Lin Liu
Liang Lin
21
5
0
16 Mar 2023
CMVAE: Causal Meta VAE for Unsupervised Meta-Learning
CMVAE: Causal Meta VAE for Unsupervised Meta-Learning
Guodong Qi
Huimin Yu
CML
SSL
14
4
0
20 Feb 2023
A Survey of Methods, Challenges and Perspectives in Causality
A Survey of Methods, Challenges and Perspectives in Causality
Gael Gendron
Michael Witbrock
Gillian Dobbie
OOD
AI4CE
CML
12
12
0
01 Feb 2023
Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Beier Zhu
Yulei Niu
Saeil Lee
Minhoe Hur
Hanwang Zhang
VLM
VPVLM
19
22
0
29 Jan 2023
Adaptively Clustering Neighbor Elements for Image-Text Generation
Adaptively Clustering Neighbor Elements for Image-Text Generation
Zihua Wang
Xu Yang
Hanwang Zhang
Haiyang Xu
Mingshi Yan
Feisi Huang
Yu Zhang
VLM
69
0
0
05 Jan 2023
Cross-modal Attention Congruence Regularization for Vision-Language
  Relation Alignment
Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment
Rohan Pandey
Rulin Shao
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
16
12
0
20 Dec 2022
VindLU: A Recipe for Effective Video-and-Language Pretraining
VindLU: A Recipe for Effective Video-and-Language Pretraining
Feng Cheng
Xizi Wang
Jie Lei
David J. Crandall
Mohit Bansal
Gedas Bertasius
VLM
27
78
0
09 Dec 2022
Causal Inference via Style Transfer for Out-of-distribution
  Generalisation
Causal Inference via Style Transfer for Out-of-distribution Generalisation
Toan Nguyen
Kien Do
D. Nguyen
Bao Duong
T. Nguyen
CML
OODD
OOD
22
10
0
06 Dec 2022
Debiasing Methods for Fairer Neural Models in Vision and Language
  Research: A Survey
Debiasing Methods for Fairer Neural Models in Vision and Language Research: A Survey
Otávio Parraga
Martin D. Móre
C. M. Oliveira
Nathan Gavenski
L. S. Kupssinskü
Adilson Medronha
L. V. Moura
Gabriel S. Simões
Rodrigo C. Barros
31
11
0
10 Nov 2022
Knowledge is Power: Understanding Causality Makes Legal judgment
  Prediction Models More Generalizable and Robust
Knowledge is Power: Understanding Causality Makes Legal judgment Prediction Models More Generalizable and Robust
Haotian Chen
Lingwei Zhang
Yiran Liu
Fanchao Chen
Yang Yu
AILaw
ELM
17
4
0
06 Nov 2022
12
Next