Action Genome: Actions as Composition of Spatio-temporal Scene Graphs

15 December 2019

Li Fei-Fei

Papers citing "Action Genome: Actions as Composition of Spatio-temporal Scene Graphs"

40 / 40 papers shown

Title
DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes S. Linok Vadim Semenov Anastasia Trunova Oleg Bulichev Dmitry A. Yudin 37 0 0 06 May 2025
Large-scale Pre-training for Grounded Video Caption Generation Evangelos Kazakos Cordelia Schmid Josef Sivic 50 0 0 13 Mar 2025
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo Min-Hung Chen De-An Huang Sifei Liu Subhashree Radhakrishnan Seon Joo Kim Yu-Chun Wang Ryo Hachiuma ObjD VLM 108 2 0 14 Jan 2025
Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding Aaron Lohner Francesco Compagno Jonathan M Francis A. Oltramari 55 2 0 10 Jan 2025
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions Xiaoyang Liu Boran Wen Xinpeng Liu Zizheng Zhou Hongwei Fan Cewu Lu Lizhuang Ma Yulong Chen Y. Li 46 2 0 27 Dec 2024
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation Hang Zhang Zhuoling Li Jun Liu LRM 100 1 0 15 Dec 2024
HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation Trong-Thuan Nguyen Pha Nguyen J. Cothren Alper Yilmaz Khoa Luu 80 1 0 27 Nov 2024
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation Rohith Peddi Saurabh Ayush Abhay Shrivastava Parag Singla Vibhav Gogate 65 0 0 20 Nov 2024
Situational Scene Graph for Structured Human-centric Situation Understanding Chinthani Sugandhika Chen Li Deepu Rajan Basura Fernando 45 1 0 30 Oct 2024
Object-Attribute-Relation Representation Based Video Semantic Communication Qiyuan Du Yiping Duan Qianqian Yang Xiaoming Tao Mérouane Debbah 47 2 0 15 Jun 2024
OED: Towards One-stage End-to-End Dynamic Scene Graph Generation Guan-Bo Wang Zhiming Li Qingchao Chen Yang Liu 30 9 0 27 May 2024
STAR: A Benchmark for Situated Reasoning in Real-World Videos Bo Wu Shoubin Yu Zhenfang Chen Joshua B Tenenbaum Chuang Gan 31 176 0 15 May 2024
3VL: Using Trees to Improve Vision-Language Models' Interpretability Nir Yellinek Leonid Karlinsky Raja Giryes CoGe VLM 44 4 0 28 Dec 2023
Medical Image Classification Using Transfer Learning and Chaos Game Optimization on the Internet of Medical Things Alhassan Mabrouk Abdelghani Dahou M. A. Abd Elaziz R. D. Díaz Redondo Mohammed Kayed 19 19 0 12 Dec 2023
Action Scene Graphs for Long-Form Understanding of Egocentric Videos Ivan Rodin Antonino Furnari Kyle Min Subarna Tripathi G. Farinella EgoV 16 12 0 06 Dec 2023
HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding Trong-Thuan Nguyen Pha Nguyen Khoa Luu 10 12 0 05 Dec 2023
LEAP: LLM-Generation of Egocentric Action Programs Eadom Dessalene Michael Maynord Cornelia Fermuller Yiannis Aloimonos 11 3 0 29 Nov 2023
Multi Sentence Description of Complex Manipulation Action Videos Fatemeh Ziaeetabar Reza Safabakhsh S. Momtazi M. Tamosiunaite F. Worgotter 13 1 0 13 Nov 2023
Semantic and Expressive Variation in Image Captions Across Languages Andre Ye Sebastin Santy Jena D. Hwang Amy X. Zhang Ranjay Krishna VLM 35 3 0 22 Oct 2023
STUPD: A Synthetic Dataset for Spatial and Temporal Relation Reasoning Palaash Agrawal Haidi Azaman Cheston Tan 30 3 0 13 Sep 2023
AGS: An Dataset and Taxonomy for Domestic Scene Sound Event Recognition Nan Che Chenrui Liu Fei Yu 19 0 0 30 Aug 2023
Human-Object Interaction Prediction in Videos through Gaze Following Zhifan Ni Esteve Valls Mascaro Hyemin Ahn Dongheui Lee 16 10 0 06 Jun 2023
COLA: A Benchmark for Compositional Text-to-image Retrieval Arijit Ray Filip Radenovic Abhimanyu Dubey Bryan A. Plummer Ranjay Krishna Kate Saenko CoGe VLM 25 34 0 05 May 2023
Unbiased Scene Graph Generation in Videos Sayak Nag Kyle Min Subarna Tripathi A. Roy-Chowdhury 16 28 0 03 Apr 2023
SPAN: Learning Similarity between Scene Graphs and Images with Transformers Yuren Cong Wentong Liao Bodo Rosenhahn M. Yang 20 6 0 02 Apr 2023
Taking A Closer Look at Visual Relation: Unbiased Video Scene Graph Generation with Decoupled Label Learning Wenqing Wang Yawei Luo Zhiqin Chen Tao Jiang Lei Chen Yi Yang Jun Xiao 21 7 0 23 Mar 2023
DDS: Decoupled Dynamic Scene-Graph Generation Network A S M Iftekhar Raphael Ruschel Satish Kumar Suya You B. S. Manjunath 23 2 0 18 Jan 2023
A General Purpose Supervisory Signal for Embodied Agents Kunal Pratap Singh Jordi Salvador Luca Weihs Aniruddha Kembhavi SSL 16 3 0 01 Dec 2022
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions Yong-Lu Li Hongwei Fan Zuoyu Qiu Yiming Dou Liang Xu ... Peiyang Guo Haisheng Su Dongliang Wang Wei Yu Wu Cewu Lu 16 7 0 14 Nov 2022
Meta Spatio-Temporal Debiasing for Video Scene Graph Generation Li Xu Haoxuan Qu Jason Kuen Jiuxiang Gu Jun Liu CML 17 27 0 23 Jul 2022
Is an Object-Centric Video Representation Beneficial for Transfer? Chuhan Zhang Ankush Gupta Andrew Zisserman ViT 16 26 0 20 Jul 2022
Continuous Scene Representations for Embodied AI S. Gadre Kiana Ehsani Shuran Song Roozbeh Mottaghi 20 46 0 31 Mar 2022
How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs Hazel Doughty Cees G. M. Snoek 12 19 0 23 Mar 2022
4D-OR: Semantic Scene Graphs for OR Domain Modeling Ege Ozsoy Evin Pınar Örnek U. Eck Tobias Czempiel F. Tombari Nassir Navab 15 34 0 22 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection Jing Tan Yuhong Wang Gangshan Wu Limin Wang 39 14 0 01 Mar 2022
RelTR: Relation Transformer for Scene Graph Generation Yuren Cong M. Yang Bodo Rosenhahn ViT 75 130 0 27 Jan 2022
Cross-modal Contrastive Distillation for Instructional Activity Anticipation Zhengyuan Yang Jingen Liu Jing-ling Huang Xiaodong He Tao Mei Chenliang Xu Jiebo Luo 14 6 0 18 Jan 2022
TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning Yang Liu Keze Wang Lingbo Liu Hao Lan Liang Lin SSL AI4TS 37 113 0 07 Dec 2021
A Variational Graph Autoencoder for Manipulation Action Recognition and Prediction Gamze Akyol Sanem Sariel E. Aksoy GNN DRL BDL 19 2 0 25 Oct 2021
Image Generation from Scene Graphs Justin Johnson Agrim Gupta Li Fei-Fei GNN 208 809 0 04 Apr 2018