Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1904.08607
Cited By
Progressive Attention Memory Network for Movie Story Question Answering
18 April 2019
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang D. Yoo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Progressive Attention Memory Network for Movie Story Question Answering"
33 / 33 papers shown
Title
Cross-Modal Reasoning with Event Correlation for Video Question Answering
Chengxiang Yin
Zhengping Che
Kun Wu
Zhiyuan Xu
Qinru Qiu
Jian Tang
166
0
0
20 Dec 2023
Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling
British Machine Vision Conference (BMVC), 2022
Hsin-Ying Lee
Hung-Ting Su
Bing-Chen Tsai
Tsung-Han Wu
Jia-Fong Yeh
Winston H. Hsu
249
2
0
08 Oct 2022
Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering
Jiong Wang
Zhou Zhao
Weike Jin
116
0
0
08 Sep 2022
Clover: Towards A Unified Video-Language Alignment and Fusion Model
Computer Vision and Pattern Recognition (CVPR), 2022
Jingjia Huang
Yinan Li
Jiashi Feng
Xinglong Wu
Xiaoshuai Sun
Rongrong Ji
VLM
249
55
0
16 Jul 2022
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering
Computer Vision and Pattern Recognition (CVPR), 2022
Jiangtong Li
Li Niu
Liqing Zhang
167
65
0
30 May 2022
Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiang-Dong Zhou
152
29
0
09 May 2022
All in One: Exploring Unified Video-Language Pre-training
Computer Vision and Pattern Recognition (CVPR), 2022
Alex Jinpeng Wang
Yixiao Ge
Rui Yan
Yuying Ge
Xudong Lin
Guanyu Cai
Jianping Wu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
265
235
0
14 Mar 2022
Video Question Answering: Datasets, Algorithms and Challenges
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
321
114
0
02 Mar 2022
NEWSKVQA: Knowledge-Aware News Video Question Answering
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2022
Pranay Gupta
Manish Gupta
223
8
0
08 Feb 2022
Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiangdong Zhou
173
4
0
10 Sep 2021
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering
Computer Vision and Pattern Recognition (CVPR), 2021
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
383
110
0
29 Apr 2021
Temporal Query Networks for Fine-grained Video Understanding
Computer Vision and Pattern Recognition (CVPR), 2021
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
216
95
0
19 Apr 2021
On the hidden treasure of dialog in video question answering
IEEE International Conference on Computer Vision (ICCV), 2021
Deniz Engin
Franccois Schnitzler
Ngoc Q. K. Duong
Yannis Avrithis
202
12
0
26 Mar 2021
Structured Co-reference Graph Attention for Video-grounded Dialogue
AAAI Conference on Artificial Intelligence (AAAI), 2021
Junyeong Kim
Sunjae Yoon
Dahyun Kim
Chang D. Yoo
202
29
0
24 Mar 2021
Multi-Modal Answer Validation for Knowledge-Based VQA
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jialin Wu
Jiasen Lu
Ashish Sabharwal
Roozbeh Mottaghi
328
163
0
23 Mar 2021
Semantic Grouping Network for Video Captioning
AAAI Conference on Artificial Intelligence (AAAI), 2021
Hobin Ryu
Sunghun Kang
Haeyong Kang
Chang D. Yoo
245
150
0
01 Feb 2021
Recent Advances in Video Question Answering: A Review of Datasets and Methods
Devshree Patel
Ratnam Parikh
Yesha Shastri
251
21
0
15 Jan 2021
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation
Computer Vision and Pattern Recognition (CVPR), 2020
Hyojin Park
Jayeon Yoo
Seohyeong Jeong
Ganesh Venkatesh
Nojun Kwak
VOS
337
40
0
21 Dec 2020
Trying Bilinear Pooling in Video-QA
T. Winterbottom
S. Xiao
A. McLean
Noura Al Moubayed
183
4
0
18 Dec 2020
SCNet: Training Inference Sample Consistency for Instance Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2020
Thang Vu
Haeyong Kang
Chang D. Yoo
ISeg
289
104
0
18 Dec 2020
iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering
Vasu Sharma
Gurneet Arora
Navpreet Kaloty
184
39
0
16 Nov 2020
TTVOS: Lightweight Video Object Segmentation with Adaptive Template Attention Module and Temporal Consistency Loss
Hyojin Park
Ganesh Venkatesh
Nojun Kwak
VOS
194
5
0
09 Nov 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Findings (Findings), 2020
Aisha Urooj Khan
Amir Mazaheri
N. Lobo
M. Shah
201
61
0
27 Oct 2020
Hierarchical Conditional Relation Networks for Multimodal Video Question Answering
International Journal of Computer Vision (IJCV), 2020
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
BDL
314
28
0
18 Oct 2020
Self-supervised pre-training and contrastive representation learning for multiple-choice video QA
AAAI Conference on Artificial Intelligence (AAAI), 2020
Seonhoon Kim
Seohyeong Jeong
Eunbyul Kim
Inho Kang
Nojun Kwak
SSL
280
43
0
17 Sep 2020
Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions
European Conference on Computer Vision (ECCV), 2020
Noa Garcia
Yuta Nakashima
236
35
0
17 Jul 2020
PA-GAN: Progressive Attention Generative Adversarial Network for Facial Attribute Editing
Zhenliang He
Meina Kan
Jichao Zhang
Shiguang Shan
CVBM
GAN
109
29
0
12 Jul 2020
Modality Shifting Attention Network for Multi-modal Video Question Answering
Junyeong Kim
Minuk Ma
T. Pham
Kyungsu Kim
Chang D. Yoo
188
75
0
04 Jul 2020
Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA
Hyounghun Kim
Zineng Tang
Joey Tianyi Zhou
121
31
0
13 May 2020
Hierarchical Conditional Relation Networks for Video Question Answering
Computer Vision and Pattern Recognition (CVPR), 2020
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
346
282
0
25 Feb 2020
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Hung Le
Nancy F. Chen
128
10
0
25 Feb 2020
Neural Reasoning, Fast and Slow, for Video Question Answering
IEEE International Joint Conference on Neural Network (IJCNN), 2019
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
179
14
0
10 Jul 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
202
252
0
25 Apr 2019
1