ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.08607
  4. Cited By
Progressive Attention Memory Network for Movie Story Question Answering

Progressive Attention Memory Network for Movie Story Question Answering

18 April 2019
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang D. Yoo
ArXiv (abs)PDFHTML

Papers citing "Progressive Attention Memory Network for Movie Story Question Answering"

34 / 34 papers shown
Cross-Modal Reasoning with Event Correlation for Video Question
  Answering
Cross-Modal Reasoning with Event Correlation for Video Question Answering
Chengxiang Yin
Zhengping Che
Kun Wu
Zhiyuan Xu
Qinru Qiu
Jian Tang
192
0
0
20 Dec 2023
Learning Fine-Grained Visual Understanding for Video Question Answering
  via Decoupling Spatial-Temporal Modeling
Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal ModelingBritish Machine Vision Conference (BMVC), 2022
Hsin-Ying Lee
Hung-Ting Su
Bing-Chen Tsai
Tsung-Han Wu
Jia-Fong Yeh
Winston H. Hsu
312
2
0
08 Oct 2022
Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering
Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering
Jiong Wang
Zhou Zhao
Weike Jin
139
0
0
08 Sep 2022
Clover: Towards A Unified Video-Language Alignment and Fusion Model
Clover: Towards A Unified Video-Language Alignment and Fusion ModelComputer Vision and Pattern Recognition (CVPR), 2022
Jingjia Huang
Yinan Li
Jiashi Feng
Xinglong Wu
Xiaoshuai Sun
Rongrong Ji
VLM
283
56
0
16 Jul 2022
From Representation to Reasoning: Towards both Evidence and Commonsense
  Reasoning for Video Question-Answering
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-AnsweringComputer Vision and Pattern Recognition (CVPR), 2022
Jiangtong Li
Li Niu
Liqing Zhang
193
66
0
30 May 2022
Multilevel Hierarchical Network with Multiscale Sampling for Video
  Question Answering
Multilevel Hierarchical Network with Multiscale Sampling for Video Question AnsweringInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiang-Dong Zhou
181
29
0
09 May 2022
All in One: Exploring Unified Video-Language Pre-training
All in One: Exploring Unified Video-Language Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2022
Alex Jinpeng Wang
Yixiao Ge
Rui Yan
Yuying Ge
Xudong Lin
Guanyu Cai
Jianping Wu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
313
237
0
14 Mar 2022
Video Question Answering: Datasets, Algorithms and Challenges
Video Question Answering: Datasets, Algorithms and ChallengesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
332
115
0
02 Mar 2022
NEWSKVQA: Knowledge-Aware News Video Question Answering
NEWSKVQA: Knowledge-Aware News Video Question AnsweringPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2022
Pranay Gupta
Manish Gupta
257
9
0
08 Feb 2022
Temporal Pyramid Transformer with Multimodal Interaction for Video
  Question Answering
Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiangdong Zhou
184
4
0
10 Sep 2021
Bridge to Answer: Structure-aware Graph Interaction Network for Video
  Question Answering
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2021
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
438
111
0
29 Apr 2021
Temporal Query Networks for Fine-grained Video Understanding
Temporal Query Networks for Fine-grained Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2021
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
272
99
0
19 Apr 2021
On the hidden treasure of dialog in video question answering
On the hidden treasure of dialog in video question answeringIEEE International Conference on Computer Vision (ICCV), 2021
Deniz Engin
Franccois Schnitzler
Ngoc Q. K. Duong
Yannis Avrithis
238
12
0
26 Mar 2021
Structured Co-reference Graph Attention for Video-grounded Dialogue
Structured Co-reference Graph Attention for Video-grounded DialogueAAAI Conference on Artificial Intelligence (AAAI), 2021
Junyeong Kim
Sunjae Yoon
Dahyun Kim
Chang D. Yoo
203
30
0
24 Mar 2021
Multi-Modal Answer Validation for Knowledge-Based VQA
Multi-Modal Answer Validation for Knowledge-Based VQAAAAI Conference on Artificial Intelligence (AAAI), 2021
Jialin Wu
Jiasen Lu
Ashish Sabharwal
Roozbeh Mottaghi
418
168
0
23 Mar 2021
Semantic Grouping Network for Video Captioning
Semantic Grouping Network for Video CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2021
Hobin Ryu
Sunghun Kang
Haeyong Kang
Chang D. Yoo
261
151
0
01 Feb 2021
Recent Advances in Video Question Answering: A Review of Datasets and
  Methods
Recent Advances in Video Question Answering: A Review of Datasets and Methods
Devshree Patel
Ratnam Parikh
Yesha Shastri
277
21
0
15 Jan 2021
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised
  Video Object Segmentation
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object SegmentationComputer Vision and Pattern Recognition (CVPR), 2020
Hyojin Park
Jayeon Yoo
Seohyeong Jeong
Ganesh Venkatesh
Nojun Kwak
VOS
373
43
0
21 Dec 2020
Trying Bilinear Pooling in Video-QA
Trying Bilinear Pooling in Video-QA
T. Winterbottom
S. Xiao
A. McLean
Noura Al Moubayed
211
4
0
18 Dec 2020
SCNet: Training Inference Sample Consistency for Instance Segmentation
SCNet: Training Inference Sample Consistency for Instance SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2020
Thang Vu
Haeyong Kang
Chang D. Yoo
ISeg
317
106
0
18 Dec 2020
iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video
  Captioning and Video Question Answering
iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering
Vasu Sharma
Gurneet Arora
Navpreet Kaloty
201
39
0
16 Nov 2020
TTVOS: Lightweight Video Object Segmentation with Adaptive Template
  Attention Module and Temporal Consistency Loss
TTVOS: Lightweight Video Object Segmentation with Adaptive Template Attention Module and Temporal Consistency Loss
Hyojin Park
Ganesh Venkatesh
Nojun Kwak
VOS
214
5
0
09 Nov 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual
  Question Answering
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question AnsweringFindings (Findings), 2020
Aisha Urooj Khan
Amir Mazaheri
N. Lobo
M. Shah
216
62
0
27 Oct 2020
Hierarchical Conditional Relation Networks for Multimodal Video Question
  Answering
Hierarchical Conditional Relation Networks for Multimodal Video Question AnsweringInternational Journal of Computer Vision (IJCV), 2020
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
BDL
379
28
0
18 Oct 2020
Self-supervised pre-training and contrastive representation learning for
  multiple-choice video QA
Self-supervised pre-training and contrastive representation learning for multiple-choice video QAAAAI Conference on Artificial Intelligence (AAAI), 2020
Seonhoon Kim
Seohyeong Jeong
Eunbyul Kim
Inho Kang
Nojun Kwak
SSL
287
43
0
17 Sep 2020
Knowledge-Based Video Question Answering with Unsupervised Scene
  Descriptions
Knowledge-Based Video Question Answering with Unsupervised Scene DescriptionsEuropean Conference on Computer Vision (ECCV), 2020
Noa Garcia
Yuta Nakashima
250
35
0
17 Jul 2020
PA-GAN: Progressive Attention Generative Adversarial Network for Facial
  Attribute Editing
PA-GAN: Progressive Attention Generative Adversarial Network for Facial Attribute Editing
Zhenliang He
Meina Kan
Jichao Zhang
Shiguang Shan
CVBMGAN
127
29
0
12 Jul 2020
Modality Shifting Attention Network for Multi-modal Video Question
  Answering
Modality Shifting Attention Network for Multi-modal Video Question Answering
Junyeong Kim
Minuk Ma
T. Pham
Kyungsu Kim
Chang D. Yoo
199
75
0
04 Jul 2020
Dense-Caption Matching and Frame-Selection Gating for Temporal
  Localization in VideoQA
Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA
Hyounghun Kim
Zineng Tang
Joey Tianyi Zhou
140
31
0
13 May 2020
Character Matters: Video Story Understanding with Character-Aware
  Relations
Character Matters: Video Story Understanding with Character-Aware Relations
Shijie Geng
Ji Zhang
Zuohui Fu
Shiyang Feng
Hang Zhang
Gerard de Melo
231
11
0
09 May 2020
Hierarchical Conditional Relation Networks for Video Question Answering
Hierarchical Conditional Relation Networks for Video Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2020
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
396
284
0
25 Feb 2020
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Hung Le
Nancy F. Chen
146
10
0
25 Feb 2020
Neural Reasoning, Fast and Slow, for Video Question Answering
Neural Reasoning, Fast and Slow, for Video Question AnsweringIEEE International Joint Conference on Neural Network (IJCNN), 2019
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
198
14
0
10 Jul 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
224
256
0
25 Apr 2019
1
Page 1 of 1