Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.05660
Cited By
Cycle-Consistency for Robust Visual Question Answering
15 February 2019
Meet Shah
Xinlei Chen
Marcus Rohrbach
Devi Parikh
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cycle-Consistency for Robust Visual Question Answering"
50 / 129 papers shown
ULN: Towards Underspecified Vision-and-Language Navigation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Weixi Feng
Tsu-Jui Fu
Yujie Lu
William Yang Wang
290
5
0
18 Oct 2022
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Xingdi Yuan
Tong Wang
Yen-Hsiang Wang
Emery Fine
Rania Abdelghani
Pauline Lucas
Hélene Sauzéon
Pierre-Yves Oudeyer
260
34
0
22 Sep 2022
A Survey of Deep Causal Models and Their Industrial Applications
Artificial Intelligence Review (Artif Intell Rev), 2022
Zongyu Li
Xiaoning Guo
Siwei Qiang
CML
AI4CE
553
16
0
19 Sep 2022
Correlation Information Bottleneck: Towards Adapting Pretrained Multimodal Models for Robust Visual Question Answering
International Journal of Computer Vision (IJCV), 2022
Jingjing Jiang
Zi-yi Liu
Nanning Zheng
364
13
0
14 Sep 2022
Hierarchical Local-Global Transformer for Temporal Sentence Grounding
IEEE transactions on multimedia (IEEE TMM), 2022
Xiang Fang
Daizong Liu
Pan Zhou
Zichuan Xu
Rui Li
234
49
0
31 Aug 2022
A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval
ACM Multimedia (ACM MM), 2022
Alex Falcon
G. Serra
Oswald Lanz
VGen
203
29
0
03 Aug 2022
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
British Machine Vision Conference (BMVC), 2022
Jun Wang
M. Gao
Yuqian Hu
Ramprasaath R. Selvaraju
Chetan Ramaiah
Ran Xu
Joseph Jaja
Larry S. Davis
ViT
217
22
0
03 Aug 2022
Consistency-preserving Visual Question Answering in Medical Imaging
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Sergio Tascon-Morales
Pablo Márquez-Neila
Raphael Sznitman
MedIm
180
17
0
27 Jun 2022
Towards Adversarial Attack on Vision-Language Pre-training Models
ACM Multimedia (ACM MM), 2022
Jiaming Zhang
Qiaomin Yi
Jitao Sang
VLM
AAML
297
148
0
19 Jun 2022
Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation
Knowledge Discovery and Data Mining (KDD), 2022
Haohan Wang
Zeyi Huang
Xindi Wu
Eric P. Xing
OOD
126
16
0
04 Jun 2022
Learning to Answer Visual Questions from Web Videos
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
ViT
314
39
0
10 May 2022
All You May Need for VQA are Image Captions
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Soravit Changpinyo
Doron Kukliansky
Idan Szpektor
Xi Chen
Nan Ding
Radu Soricut
259
83
0
04 May 2022
Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
European Conference on Computer Vision (ECCV), 2022
Spencer Whitehead
Suzanne Petryk
Vedaad Shakib
Joseph E. Gonzalez
Trevor Darrell
Anna Rohrbach
Marcus Rohrbach
349
74
0
28 Apr 2022
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks
Zhecan Wang
Noel Codella
Yen-Chun Chen
Luowei Zhou
Xiyang Dai
...
Jianwei Yang
Haoxuan You
Kai-Wei Chang
Shih-Fu Chang
Lu Yuan
VLM
OffRL
260
27
0
22 Apr 2022
Measuring Compositional Consistency for Video Question Answering
Computer Vision and Pattern Recognition (CVPR), 2022
Mona Gandhi
Mustafa Omer Gul
Eva Prakash
Madeleine Grunde-McLaughlin
Ranjay Krishna
Maneesh Agrawala
CoGe
208
17
0
14 Apr 2022
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Hongru Wang
Wei Liang
Jianbing Shen
Luc Van Gool
Wenguan Wang
209
73
0
30 Mar 2022
CARETS: A Consistency And Robustness Evaluative Test Suite for VQA
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Carlos E. Jimenez
Olga Russakovsky
Karthik Narasimhan
CoGe
156
14
0
15 Mar 2022
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Zhecan Wang
Noel Codella
Yen-Chun Chen
Luowei Zhou
Jianwei Yang
Xiyang Dai
Bin Xiao
Haoxuan You
Shih-Fu Chang
Lu Yuan
CLIP
VLM
209
44
0
15 Jan 2022
MLP Architectures for Vision-and-Language Modeling: An Empirical Study
Yi-Liang Nie
Linjie Li
Zhe Gan
Shuohang Wang
Chenguang Zhu
Michael Zeng
Zicheng Liu
Joey Tianyi Zhou
Lijuan Wang
149
8
0
08 Dec 2021
Robustness through Data Augmentation Loss Consistency
Tianjian Huang
Shaunak Halbe
Chinnadhurai Sankar
P. Amini
Satwik Kottur
A. Geramifard
Meisam Razaviyayn
Ahmad Beirami
OOD
383
10
0
21 Oct 2021
Breaking the Dilemma of Medical Image-to-image Translation
Lingke Kong
Chenyu Lian
Detian Huang
Zhenjiang Li
Yanle Hu
Qichao Zhou
GAN
MedIm
369
182
0
13 Oct 2021
Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
Long Chen
Yuhang Zheng
Yulei Niu
Hanwang Zhang
Jun Xiao
AAML
OOD
275
47
0
03 Oct 2021
Multimodal Integration of Human-Like Attention in Visual Question Answering
Ekta Sood
Fabian Kögel
Philippe Muller
Dominike Thomas
Mihai Bâce
Andreas Bulling
166
22
0
27 Sep 2021
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering
Conference on Computational Natural Language Learning (CoNLL), 2021
Ekta Sood
Fabian Kögel
Florian Strohm
Prajit Dhar
Andreas Bulling
169
21
0
27 Sep 2021
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil
Cheng Zhang
D. Xuan
Wei-Lun Chao
264
23
0
13 Sep 2021
Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training Debiasing
International Conference on Information and Knowledge Management (CIKM), 2021
Sindhu C. M. Gowda
Shalmali Joshi
Haoran Zhang
Marzyeh Ghassemi
CML
166
8
0
27 Aug 2021
BiaSwap: Removing dataset bias with bias-tailored swapping augmentation
IEEE International Conference on Computer Vision (ICCV), 2021
Eungyeup Kim
Jihyeon Janel Lee
Jaegul Choo
222
97
0
23 Aug 2021
Separating Skills and Concepts for Novel Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2021
Spencer Whitehead
Hui Wu
Heng Ji
Rogerio Feris
Kate Saenko
CoGe
179
38
0
19 Jul 2021
The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models
G. dÉon
Jason dÉon
J. R. Wright
Kevin Leyton-Brown
174
84
0
01 Jul 2021
Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Daniel Rosenberg
Itai Gat
Amir Feder
Roi Reichart
AAML
263
16
0
08 Jun 2021
Human-Adversarial Visual Question Answering
Neural Information Processing Systems (NeurIPS), 2021
Sasha Sheng
Amanpreet Singh
Vedanuj Goswami
Jose Alberto Lopez Magana
Wojciech Galuba
Devi Parikh
Douwe Kiela
OOD
EgoV
AAML
118
69
0
04 Jun 2021
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models
IEEE International Conference on Computer Vision (ICCV), 2021
Linjie Li
Jie Lei
Zhe Gan
Jingjing Liu
AAML
VLM
288
91
0
01 Jun 2021
LPF: A Language-Prior Feedback Objective Function for De-biased Visual Question Answering
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Zujie Liang
Haifeng Hu
Jiaying Zhu
203
44
0
29 May 2021
Contrastive Fine-tuning Improves Robustness for Neural Rankers
Findings (Findings), 2021
Xiaofei Ma
Cicero Nogueira dos Santos
Andrew O. Arnold
265
23
0
27 May 2021
News Headline Grouping as a Challenging NLU Task
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Philippe Laban
Lucas Bandarkar
Marti A. Hearst
147
15
0
12 May 2021
Cross-Modal Generative Augmentation for Visual Question Answering
British Machine Vision Conference (BMVC), 2021
Zixu Wang
Yishu Miao
Lucia Specia
208
11
0
11 May 2021
gComm: An environment for investigating generalization in Grounded Language Acquisition
Rishi Hazra
Sonu Dixit
176
1
0
09 May 2021
Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yanai Elazar
Hongming Zhang
Yoav Goldberg
Dan Roth
ReLM
LRM
307
45
0
16 Apr 2021
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
IEEE International Conference on Computer Vision (ICCV), 2021
Corentin Dancette
Rémi Cadène
Damien Teney
Matthieu Cord
CML
311
91
0
07 Apr 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Computer Vision and Pattern Recognition (CVPR), 2021
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
278
25
0
29 Mar 2021
A Comprehensive Review of the Video-to-Text Problem
Artificial Intelligence Review (AIR), 2021
Jesus Perez-Martin
B. Bustos
S. Guimarães
I. Sipiran
Jorge A. Pérez
Grethel Coello Said
261
18
0
27 Mar 2021
VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Computer Vision and Pattern Recognition (CVPR), 2021
Xudong Lin
Gedas Bertasius
Jue Wang
Shih-Fu Chang
Devi Parikh
Lorenzo Torresani
VGen
232
74
0
28 Jan 2021
Intrinsically Motivated Compositional Language Emergence
Rishi Hazra
Sonu Dixit
Sayambhu Sen
283
1
0
09 Dec 2020
Learning from Lexical Perturbations for Consistent Visual Question Answering
Spencer Whitehead
Hui Wu
Yi R. Fung
Heng Ji
Rogerio Feris
Kate Saenko
145
11
0
26 Nov 2020
Squared
ℓ
2
\ell_2
ℓ
2
Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations
Haohan Wang
Zeyi Huang
Xindi Wu
Eric Xing
145
2
0
25 Nov 2020
Logically Consistent Loss for Visual Question Answering
Anh-Cat Le-Ngo
T. Tran
Santu Rana
Sunil R. Gupta
Svetha Venkatesh
OOD
187
0
0
19 Nov 2020
An Improved Attention for Visual Question Answering
Tanzila Rahman
Shih-Han Chou
Leonid Sigal
Giuseppe Carenini
143
55
0
04 Nov 2020
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Neural Information Processing Systems (NeurIPS), 2020
Simon Ging
Mohammadreza Zolfaghari
Hamed Pirsiavash
Thomas Brox
ViT
CLIP
200
178
0
01 Nov 2020
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
North American Chapter of the Association for Computational Linguistics (NAACL), 2020
Sameer Dharur
Purva Tendulkar
Dhruv Batra
Devi Parikh
Ramprasaath R. Selvaraju
147
2
0
20 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Neurocomputing (Neurocomputing), 2020
Wei Chen
Weiping Wang
Tianpeng Liu
M. Lew
VLM
329
36
0
16 Oct 2020
Previous
1
2
3
Next