ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.10906
  4. Cited By
Motion-Appearance Co-Memory Networks for Video Question Answering

Motion-Appearance Co-Memory Networks for Video Question Answering

29 March 2018
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
ArXivPDFHTML

Papers citing "Motion-Appearance Co-Memory Networks for Video Question Answering"

18 / 118 papers shown
Title
Noise Estimation Using Density Estimation for Self-Supervised Multimodal
  Learning
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
9
121
0
06 Mar 2020
Hierarchical Conditional Relation Networks for Video Question Answering
Hierarchical Conditional Relation Networks for Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
14
258
0
25 Feb 2020
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Hung Le
Nancy F. Chen
17
9
0
25 Feb 2020
Action Modifiers: Learning from Adverbs in Instructional Videos
Action Modifiers: Learning from Adverbs in Instructional Videos
Hazel Doughty
Ivan Laptev
W. Mayol-Cuevas
Dima Damen
10
30
0
13 Dec 2019
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue
  Generation
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue Generation
Kuan-Yen Lin
Chao-Chun Hsu
Yun-Nung (Vivian) Chen
Lun-Wei Ku
VGen
11
20
0
22 Aug 2019
Learning Question-Guided Video Representation for Multi-Turn Video
  Question Answering
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Guan-Lin Chao
Abhinav Rastogi
Semih Yavuz
Dilek Z. Hakkani-Tür
Jindong Chen
Ian Lane
8
6
0
31 Jul 2019
Two-stream Spatiotemporal Feature for Video QA Task
Two-stream Spatiotemporal Feature for Video QA Task
Chiwan Song
Woobin Im
Sung-eui Yoon
14
0
0
11 Jul 2019
Neural Reasoning, Fast and Slow, for Video Question Answering
Neural Reasoning, Fast and Slow, for Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
6
14
0
10 Jul 2019
Open-Ended Long-Form Video Question Answering via Hierarchical
  Convolutional Self-Attention Networks
Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks
Zhu Zhang
Zhou Zhao
Zhijie Lin
Jingkuan Song
Xiaofei He
BDL
11
14
0
28 Jun 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Zhenfang Chen
Lin Ma
Wenhan Luo
Kwan-Yee Kenneth Wong
10
101
0
06 Jun 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via
  Question Answering
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering
Zhou Yu
D. Xu
Jun-chen Yu
Ting Yu
Zhou Zhao
Yueting Zhuang
Dacheng Tao
6
433
0
06 Jun 2019
Gaining Extra Supervision via Multi-task learning for Multi-Modal Video
  Question Answering
Gaining Extra Supervision via Multi-task learning for Multi-Modal Video Question Answering
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang-Dong Yoo
13
27
0
28 May 2019
Memory-Augmented Temporal Dynamic Learning for Action Recognition
Memory-Augmented Temporal Dynamic Learning for Action Recognition
Yuan. Yuan
Dong Wang
Qi. Wang
17
13
0
30 Apr 2019
Progressive Attention Memory Network for Movie Story Question Answering
Progressive Attention Memory Network for Movie Story Question Answering
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang-Dong Yoo
11
75
0
18 Apr 2019
Heterogeneous Memory Enhanced Multimodal Attention Model for Video
  Question Answering
Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering
Chenyou Fan
Xiaofan Zhang
Shu Zhang
Wensheng Wang
Chi Zhang
Heng-Chiao Huang
11
276
0
08 Apr 2019
CTAP: Complementary Temporal Action Proposal Generation
CTAP: Complementary Temporal Action Proposal Generation
J. Gao
Kan Chen
Ram Nevatia
ViT
6
178
0
12 Jul 2018
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Kan Chen
J. Gao
Ram Nevatia
22
89
0
11 Mar 2018
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,465
0
06 Jun 2016
Previous
123