ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.08607
  4. Cited By
Progressive Attention Memory Network for Movie Story Question Answering

Progressive Attention Memory Network for Movie Story Question Answering

18 April 2019
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang D. Yoo
ArXiv (abs)PDFHTML

Papers citing "Progressive Attention Memory Network for Movie Story Question Answering"

33 / 33 papers shown
Cross-Modal Reasoning with Event Correlation for Video Question
  Answering
Cross-Modal Reasoning with Event Correlation for Video Question Answering
Chengxiang Yin
Zhengping Che
Kun Wu
Zhiyuan Xu
Qinru Qiu
Jian Tang
181
0
0
20 Dec 2023
Learning Fine-Grained Visual Understanding for Video Question Answering
  via Decoupling Spatial-Temporal Modeling
Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal ModelingBritish Machine Vision Conference (BMVC), 2022
Hsin-Ying Lee
Hung-Ting Su
Bing-Chen Tsai
Tsung-Han Wu
Jia-Fong Yeh
Winston H. Hsu
305
2
0
08 Oct 2022
Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering
Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering
Jiong Wang
Zhou Zhao
Weike Jin
128
0
0
08 Sep 2022
Clover: Towards A Unified Video-Language Alignment and Fusion Model
Clover: Towards A Unified Video-Language Alignment and Fusion ModelComputer Vision and Pattern Recognition (CVPR), 2022
Jingjia Huang
Yinan Li
Jiashi Feng
Xinglong Wu
Xiaoshuai Sun
Rongrong Ji
VLM
278
55
0
16 Jul 2022
From Representation to Reasoning: Towards both Evidence and Commonsense
  Reasoning for Video Question-Answering
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-AnsweringComputer Vision and Pattern Recognition (CVPR), 2022
Jiangtong Li
Li Niu
Liqing Zhang
183
66
0
30 May 2022
Multilevel Hierarchical Network with Multiscale Sampling for Video
  Question Answering
Multilevel Hierarchical Network with Multiscale Sampling for Video Question AnsweringInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiang-Dong Zhou
160
29
0
09 May 2022
All in One: Exploring Unified Video-Language Pre-training
All in One: Exploring Unified Video-Language Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2022
Alex Jinpeng Wang
Yixiao Ge
Rui Yan
Yuying Ge
Xudong Lin
Guanyu Cai
Jianping Wu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
293
236
0
14 Mar 2022
Video Question Answering: Datasets, Algorithms and Challenges
Video Question Answering: Datasets, Algorithms and ChallengesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
329
114
0
02 Mar 2022
NEWSKVQA: Knowledge-Aware News Video Question Answering
NEWSKVQA: Knowledge-Aware News Video Question AnsweringPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2022
Pranay Gupta
Manish Gupta
235
9
0
08 Feb 2022
Temporal Pyramid Transformer with Multimodal Interaction for Video
  Question Answering
Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiangdong Zhou
181
4
0
10 Sep 2021
Bridge to Answer: Structure-aware Graph Interaction Network for Video
  Question Answering
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2021
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
431
110
0
29 Apr 2021
Temporal Query Networks for Fine-grained Video Understanding
Temporal Query Networks for Fine-grained Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2021
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
251
98
0
19 Apr 2021
On the hidden treasure of dialog in video question answering
On the hidden treasure of dialog in video question answeringIEEE International Conference on Computer Vision (ICCV), 2021
Deniz Engin
Franccois Schnitzler
Ngoc Q. K. Duong
Yannis Avrithis
226
12
0
26 Mar 2021
Structured Co-reference Graph Attention for Video-grounded Dialogue
Structured Co-reference Graph Attention for Video-grounded DialogueAAAI Conference on Artificial Intelligence (AAAI), 2021
Junyeong Kim
Sunjae Yoon
Dahyun Kim
Chang D. Yoo
202
30
0
24 Mar 2021
Multi-Modal Answer Validation for Knowledge-Based VQA
Multi-Modal Answer Validation for Knowledge-Based VQAAAAI Conference on Artificial Intelligence (AAAI), 2021
Jialin Wu
Jiasen Lu
Ashish Sabharwal
Roozbeh Mottaghi
369
166
0
23 Mar 2021
Semantic Grouping Network for Video Captioning
Semantic Grouping Network for Video CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2021
Hobin Ryu
Sunghun Kang
Haeyong Kang
Chang D. Yoo
250
151
0
01 Feb 2021
Recent Advances in Video Question Answering: A Review of Datasets and
  Methods
Recent Advances in Video Question Answering: A Review of Datasets and Methods
Devshree Patel
Ratnam Parikh
Yesha Shastri
270
21
0
15 Jan 2021
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised
  Video Object Segmentation
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object SegmentationComputer Vision and Pattern Recognition (CVPR), 2020
Hyojin Park
Jayeon Yoo
Seohyeong Jeong
Ganesh Venkatesh
Nojun Kwak
VOS
367
43
0
21 Dec 2020
Trying Bilinear Pooling in Video-QA
Trying Bilinear Pooling in Video-QA
T. Winterbottom
S. Xiao
A. McLean
Noura Al Moubayed
207
4
0
18 Dec 2020
SCNet: Training Inference Sample Consistency for Instance Segmentation
SCNet: Training Inference Sample Consistency for Instance SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2020
Thang Vu
Haeyong Kang
Chang D. Yoo
ISeg
317
106
0
18 Dec 2020
iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video
  Captioning and Video Question Answering
iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering
Vasu Sharma
Gurneet Arora
Navpreet Kaloty
196
39
0
16 Nov 2020
TTVOS: Lightweight Video Object Segmentation with Adaptive Template
  Attention Module and Temporal Consistency Loss
TTVOS: Lightweight Video Object Segmentation with Adaptive Template Attention Module and Temporal Consistency Loss
Hyojin Park
Ganesh Venkatesh
Nojun Kwak
VOS
214
5
0
09 Nov 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual
  Question Answering
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question AnsweringFindings (Findings), 2020
Aisha Urooj Khan
Amir Mazaheri
N. Lobo
M. Shah
213
61
0
27 Oct 2020
Hierarchical Conditional Relation Networks for Multimodal Video Question
  Answering
Hierarchical Conditional Relation Networks for Multimodal Video Question AnsweringInternational Journal of Computer Vision (IJCV), 2020
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
BDL
342
28
0
18 Oct 2020
Self-supervised pre-training and contrastive representation learning for
  multiple-choice video QA
Self-supervised pre-training and contrastive representation learning for multiple-choice video QAAAAI Conference on Artificial Intelligence (AAAI), 2020
Seonhoon Kim
Seohyeong Jeong
Eunbyul Kim
Inho Kang
Nojun Kwak
SSL
284
43
0
17 Sep 2020
Knowledge-Based Video Question Answering with Unsupervised Scene
  Descriptions
Knowledge-Based Video Question Answering with Unsupervised Scene DescriptionsEuropean Conference on Computer Vision (ECCV), 2020
Noa Garcia
Yuta Nakashima
250
35
0
17 Jul 2020
PA-GAN: Progressive Attention Generative Adversarial Network for Facial
  Attribute Editing
PA-GAN: Progressive Attention Generative Adversarial Network for Facial Attribute Editing
Zhenliang He
Meina Kan
Jichao Zhang
Shiguang Shan
CVBMGAN
118
29
0
12 Jul 2020
Modality Shifting Attention Network for Multi-modal Video Question
  Answering
Modality Shifting Attention Network for Multi-modal Video Question Answering
Junyeong Kim
Minuk Ma
T. Pham
Kyungsu Kim
Chang D. Yoo
188
75
0
04 Jul 2020
Dense-Caption Matching and Frame-Selection Gating for Temporal
  Localization in VideoQA
Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA
Hyounghun Kim
Zineng Tang
Joey Tianyi Zhou
128
31
0
13 May 2020
Hierarchical Conditional Relation Networks for Video Question Answering
Hierarchical Conditional Relation Networks for Video Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2020
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
358
282
0
25 Feb 2020
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Hung Le
Nancy F. Chen
132
10
0
25 Feb 2020
Neural Reasoning, Fast and Slow, for Video Question Answering
Neural Reasoning, Fast and Slow, for Video Question AnsweringIEEE International Joint Conference on Neural Network (IJCNN), 2019
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
188
14
0
10 Jul 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
214
253
0
25 Apr 2019
1