ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.06761
  4. Cited By
MovieGraphs: Towards Understanding Human-Centric Situations from Videos

MovieGraphs: Towards Understanding Human-Centric Situations from Videos

19 December 2017
Paul Vicol
Makarand Tapaswi
Lluis Castrejon
Sanja Fidler
ArXivPDFHTML

Papers citing "MovieGraphs: Towards Understanding Human-Centric Situations from Videos"

50 / 70 papers shown
Title
AsyReC: A Multimodal Graph-based Framework for Spatio-Temporal Asymmetric Dyadic Relationship Classification
AsyReC: A Multimodal Graph-based Framework for Spatio-Temporal Asymmetric Dyadic Relationship Classification
Wang Tang
Fethiye Irmak Dogan
Linbo Qing
Hatice Gunes
35
0
0
07 Apr 2025
Towards Online Multi-Modal Social Interaction Understanding
Towards Online Multi-Modal Social Interaction Understanding
X. Li
Shijian Deng
Bolin Lai
Weiguo Pian
James M. Rehg
Yapeng Tian
46
0
0
25 Mar 2025
Modality-Aware Shot Relating and Comparing for Video Scene Detection
Modality-Aware Shot Relating and Comparing for Video Scene Detection
Jiawei Tan
Hongxing Wang
Kang Dang
Jiaxin Li
Zhilong Ou
33
0
0
23 Dec 2024
Generating Event-oriented Attribution for Movies via Two-Stage
  Prefix-Enhanced Multimodal LLM
Generating Event-oriented Attribution for Movies via Two-Stage Prefix-Enhanced Multimodal LLM
Yuanjie Lyu
Tong Bill Xu
Zihan Niu
Bo Peng
Jing Ke
Enhong Chen
23
0
0
14 Sep 2024
Towards Social AI: A Survey on Understanding Social Interactions
Towards Social AI: A Survey on Understanding Social Interactions
Sangmin Lee
Minzhi Li
Bolin Lai
Wenqi Jia
Fiona Ryan
...
Ozgur Kara
Bikram Boote
Weiyan Shi
Diyi Yang
James M. Rehg
39
4
0
05 Sep 2024
Learning Video Context as Interleaved Multimodal Sequences
Learning Video Context as Interleaved Multimodal Sequences
S. Shao
Pengchuan Zhang
Y. Li
Xide Xia
A. Meso
Ziteng Gao
Jinheng Xie
N. Holliman
Mike Zheng Shou
43
5
0
31 Jul 2024
VideoClusterNet: Self-Supervised and Adaptive Clustering For Videos
VideoClusterNet: Self-Supervised and Adaptive Clustering For Videos
Devesh Walawalkar
Pablo Garrido
CVBM
41
0
0
16 Jul 2024
A Survey of Video Datasets for Grounded Event Understanding
A Survey of Video Datasets for Grounded Event Understanding
Kate Sanders
Benjamin Van Durme
32
4
0
14 Jun 2024
From a Social Cognitive Perspective: Context-aware Visual Social
  Relationship Recognition
From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition
Shiwei Wu
Chao Zhang
Joya Chen
Tong Bill Xu
Likang Wu
Yao Hu
Enhong Chen
17
0
0
12 Jun 2024
"Previously on ..." From Recaps to Story Summarization
"Previously on ..." From Recaps to Story Summarization
Aditya Kumar Singh
Dhruv Srivastava
Makarand Tapaswi
40
0
0
19 May 2024
JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context
  and Dynamics of Human Interactions Within Social Groups
JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
Simindokht Jahangard
Zhixi Cai
Shiki Wen
Hamid Rezatofighi
26
6
0
06 Apr 2024
Visual Objectification in Films: Towards a New AI Task for Video
  Interpretation
Visual Objectification in Films: Towards a New AI Task for Video Interpretation
Julie Tores
L. Sassatelli
Hui-Yin Wu
Clement Bergman
Lea Andolfi
...
F. Precioso
Thierry Devars
Magali Guaresi
Virginie Julliard
Sarah Lecossais
35
2
0
24 Jan 2024
FiGCLIP: Fine-Grained CLIP Adaptation via Densely Annotated Videos
FiGCLIP: Fine-Grained CLIP Adaptation via Densely Annotated Videos
S. DarshanSingh
Zeeshan Khan
Makarand Tapaswi
VLM
CLIP
30
3
0
15 Jan 2024
SMILE: Multimodal Dataset for Understanding Laughter in Video with
  Language Models
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models
Lee Hyun
Kim Sung-Bin
Seungju Han
Youngjae Yu
Tae-Hyun Oh
27
13
0
15 Dec 2023
MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie
  Understanding
MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding
Hongjie Zhang
Yi Liu
Lu Dong
Yifei Huang
Z. Ling
Yali Wang
Limin Wang
Yu Qiao
23
25
0
08 Dec 2023
SPOT! Revisiting Video-Language Models for Event Understanding
SPOT! Revisiting Video-Language Models for Event Understanding
Gengyuan Zhang
Jinhe Bi
Jindong Gu
Yanyu Chen
Volker Tresp
19
2
0
21 Nov 2023
Long-range Multimodal Pretraining for Movie Understanding
Long-range Multimodal Pretraining for Movie Understanding
Dawit Mureja Argaw
Joon-Young Lee
Markus Woodson
In So Kweon
Fabian Caba Heilbron
VLM
25
7
0
18 Aug 2023
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language
  Understanding
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding
K. Mangalam
Raiymbek Akshulakov
Jitendra Malik
25
245
0
17 Aug 2023
MovieChat: From Dense Token to Sparse Memory for Long Video
  Understanding
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Enxin Song
Wenhao Chai
Guanhong Wang
Yucheng Zhang
Haoyang Zhou
...
Tianbo Ye
Yanting Zhang
Yang Lu
Jenq-Neng Hwang
Gaoang Wang
VLM
MLLM
22
260
0
31 Jul 2023
Synthesizing Event-centric Knowledge Graphs of Daily Activities Using
  Virtual Space
Synthesizing Event-centric Knowledge Graphs of Daily Activities Using Virtual Space
S. Egami
Takanori Ugai
Mikiko Oono
K. Kitamura
Ken Fukuda
20
11
0
30 Jul 2023
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order
  Learning
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning
Jianghui Wang
Yuxuan Wang
Dongyan Zhao
Zilong Zheng
41
1
0
04 Jun 2023
NormBank: A Knowledge Bank of Situational Social Norms
NormBank: A Knowledge Bank of Situational Social Norms
Caleb Ziems
Jane Dwivedi-Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
18
41
0
26 May 2023
Learning Emotion Representations from Verbal and Nonverbal Communication
Learning Emotion Representations from Verbal and Nonverbal Communication
Sitao Zhang
Yimu Pan
J. Z. Wang
VLM
66
21
0
22 May 2023
How you feelin'? Learning Emotions and Mental States in Movie Scenes
How you feelin'? Learning Emotions and Mental States in Movie Scenes
D. Srivastava
A. Singh
Makarand Tapaswi
32
10
0
12 Apr 2023
Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for
  Multi-modal Highlight Detection in Movies
Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Bei Gan
Xiujun Shu
Ruizhi Qiao
Haoqian Wu
Keyun Chen
Hanjun Li
Bohan Ren
26
5
0
26 Mar 2023
Multimodal Subtask Graph Generation from Instructional Videos
Multimodal Subtask Graph Generation from Instructional Videos
Y. Jang
Sungryull Sohn
Lajanugen Logeswaran
Tiange Luo
Moontae Lee
Ho Hin Lee
23
9
0
17 Feb 2023
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
Chengzhi Mao
Revant Teotia
Amrutha Sundar
Sachit Menon
Junfeng Yang
Xin Eric Wang
Carl Vondrick
13
29
0
12 Dec 2022
MovieCLIP: Visual Scene Recognition in Movies
MovieCLIP: Visual Scene Recognition in Movies
Digbalay Bose
Rajat Hebbar
Krishna Somandepalli
Haoyang Zhang
Yin Cui
K. Cole-McLaughlin
H. Wang
Shrikanth Narayanan
CLIP
12
20
0
20 Oct 2022
Match Cutting: Finding Cuts with Smooth Visual Transitions
Match Cutting: Finding Cuts with Smooth Visual Transitions
Boris Chen
Amir Ziai
Rebecca Tucker
Yuchen Xie
VGen
23
14
0
11 Oct 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate
  Semantic Attention Refinement
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
16
34
0
18 Aug 2022
Self-Contained Entity Discovery from Captioned Videos
Self-Contained Entity Discovery from Captioned Videos
M. Ayoughi
P. Mettes
Paul T. Groth
26
2
0
13 Aug 2022
Dilated Context Integrated Network with Cross-Modal Consensus for
  Temporal Emotion Localization in Videos
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos
Juncheng Billy Li
Junlin Xie
Linchao Zhu
Long Qian
Siliang Tang
...
Haochen Shi
Shengyu Zhang
Longhui Wei
Qi Tian
Yueting Zhuang
32
12
0
03 Aug 2022
Hierarchical Self-supervised Representation Learning for Movie
  Understanding
Hierarchical Self-supervised Representation Learning for Movie Understanding
Fanyi Xiao
Kaustav Kundu
Joseph Tighe
Davide Modolo
SSL
37
24
0
06 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
38
101
0
04 Apr 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story
  Understanding
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
Yidan Sun
Qin Chao
Yangfeng Ji
Boyang Albert Li
VGen
33
10
0
11 Mar 2022
DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor
  Points
DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points
Zhengfei Kuang
Jiaman Li
Mingming He
Tong Wang
Yajie Zhao
14
16
0
13 Dec 2021
Feature Generation for Long-tail Classification
Feature Generation for Long-tail Classification
Rahul Vigneswaran
M. Law
V. Balasubramanian
Makarand Tapaswi
VLM
19
16
0
10 Nov 2021
HighlightMe: Detecting Highlights from Human-Centric Videos
HighlightMe: Detecting Highlights from Human-Centric Videos
Uttaran Bhattacharya
Gang Wu
Stefano Petrangeli
Viswanathan Swaminathan
Dinesh Manocha
29
10
0
05 Oct 2021
Pairwise Emotional Relationship Recognition in Drama Videos: Dataset and
  Benchmark
Pairwise Emotional Relationship Recognition in Drama Videos: Dataset and Benchmark
Xun Gao
Yin Zhao
Jie Zhang
Longjun Cai
19
6
0
23 Sep 2021
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Bernard Ghanem
VGen
29
28
0
12 Sep 2021
OSCAR-Net: Object-centric Scene Graph Attention for Image Attribution
OSCAR-Net: Object-centric Scene Graph Attention for Image Attribution
Eric N. D. Nguyen
Tu Bui
Vishy Swaminathan
John Collomosse
13
15
0
07 Aug 2021
Use of Affective Visual Information for Summarization of Human-Centric
  Videos
Use of Affective Visual Information for Summarization of Human-Centric Videos
Berkay Köprü
E. Erzin
AI4TS
22
6
0
08 Jul 2021
Towards Long-Form Video Understanding
Towards Long-Form Video Understanding
Chaoxia Wu
Philipp Krahenbuhl
VLM
ViT
36
165
0
21 Jun 2021
SocAoG: Incremental Graph Parsing for Social Relation Inference in
  Dialogues
SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues
Liang Qiu
Yuan Liang
Yizhou Zhao
Pan Lu
Baolin Peng
Zhou Yu
Ying Nian Wu
Song-Chun Zhu
32
17
0
02 Jun 2021
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
Andrew Brown
Vicky Kalogeiton
Andrew Zisserman
CVBM
20
30
0
20 May 2021
Visual Semantic Role Labeling for Video Understanding
Visual Semantic Role Labeling for Video Understanding
Arka Sadhu
Tanmay Gupta
Mark Yatskar
Ram Nevatia
Aniruddha Kembhavi
VLM
20
68
0
02 Apr 2021
Affect2MM: Affective Analysis of Multimedia Content Using Emotion
  Causality
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality
Trisha Mittal
Puneet Mathur
Aniket Bera
Dinesh Manocha
CVBM
14
37
0
11 Mar 2021
Understanding in Artificial Intelligence
Understanding in Artificial Intelligence
S. Maetschke
D. M. Iraola
Pieter Barnard
Elaheh Shafieibavani
Peter Zhong
Ying Xu
Antonio Jimeno Yepes
ELM
VLM
11
0
0
17 Jan 2021
Robust Character Labeling in Movie Videos: Data Resources and
  Self-supervised Feature Adaptation
Robust Character Labeling in Movie Videos: Data Resources and Self-supervised Feature Adaptation
Krishna Somandepalli
Rajat Hebbar
Shrikanth Narayanan
CVBM
24
5
0
25 Aug 2020
Graph Wasserstein Correlation Analysis for Movie Retrieval
Graph Wasserstein Correlation Analysis for Movie Retrieval
Xueyao Zhang
Tong Zhang
Xiaobin Hong
Zhen Cui
Jian Yang
14
2
0
06 Aug 2020
12
Next