Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1904.06539
Cited By
v1
v2
v3
v4
v5 (latest)
HAKE: Human Activity Knowledge Engine
13 April 2019
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Mingyang Chen
Ze Ma
Shiyi Wang
Haoshu Fang
Cewu Lu
HAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HAKE: Human Activity Knowledge Engine"
39 / 39 papers shown
Title
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
IEEE International Conference on Robotics and Automation (ICRA), 2025
Chen Wang
Fei Xia
Wenhao Yu
Tingnan Zhang
Ruohan Zhang
Ce Liu
Li Fei-Fei
Jie Tan
Jacky Liang
183
7
0
17 Apr 2025
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaoyang Liu
Boran Wen
Xinpeng Liu
Zizheng Zhou
Hongwei Fan
Cewu Lu
Lizhuang Ma
Yulong Chen
Yongqian Li
338
3
0
27 Dec 2024
GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction
Patrick Kwon
Chen Chen
Hanbyul Joo
193
6
0
17 Oct 2024
NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality
Chaofan Tao
Gukyeong Kwon
Varad Gunjal
Hao Yang
Zhaowei Cai
Yonatan Dukler
Ashwin Swaminathan
R. Manmatha
Colin Jon Taylor
Stefano Soatto
CoGe
125
0
0
18 Aug 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie Yang
Bingliang Li
Ailing Zeng
L. Zhang
Ruimao Zhang
VLM
178
28
0
11 Jun 2024
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang
Shichao Dong
Yapeng Zhu
Kelu Yao
Weidong Zhao
Chao Li
Ping Luo
CoGe
LRM
205
5
0
27 May 2024
Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning
Neural Information Processing Systems (NeurIPS), 2023
Xiaoqian Wu
Yong-Lu Li
Jianhua Sun
Cewu Lu
134
28
0
29 Nov 2023
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
International Conference on Learning Representations (ICLR), 2023
Rujie Wu
Xiaojian Ma
Zhenliang Zhang
Wei Wang
Qing Li
Song-Chun Zhu
Yizhou Wang
LRM
VLM
247
14
0
16 Oct 2023
A Grammatical Compositional Model for Video Action Detection
Zhijun Zhang
Xu Zou
Jiahuan Zhou
Sheng Zhong
Ying Wu
168
0
0
04 Oct 2023
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?
Haiwei Yang
Liang Ding
Jun Rao
Ye Liu
Li Shen
Changxing Ding
150
22
0
24 Aug 2023
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Neural Information Processing Systems (NeurIPS), 2023
Sivan Doveh
Assaf Arbelle
Sivan Harary
Roei Herzig
Donghyun Kim
...
Yikang Shen
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLM
CoGe
309
73
0
31 May 2023
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Harman Singh
Pengchuan Zhang
Qifan Wang
Mengjiao MJ Wang
Wenhan Xiong
Jingfei Du
Yu Chen
CoGe
VLM
248
31
0
23 May 2023
Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Roei Herzig
Alon Mendelson
Leonid Karlinsky
Assaf Arbelle
Rogerio Feris
Trevor Darrell
Amir Globerson
VLM
207
38
0
10 May 2023
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
IEEE International Conference on Computer Vision (ICCV), 2023
Xu Ju
Ailing Zeng
Chenchen Zhao
Jianan Wang
Lei Zhang
Qian Xu
DiffM
153
121
0
09 Apr 2023
From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding
Computer Vision and Pattern Recognition (CVPR), 2023
Yong-Lu Li
Xiaoqian Wu
Xinpeng Liu
Zehao Wang
Yiming Dou
...
Junyi Zhang
Yixing Li
Jingru Tan
Xudong Lu
Cewu Lu
286
18
0
02 Apr 2023
Going Beyond Nouns With Vision & Language Models Using Synthetic Data
IEEE International Conference on Computer Vision (ICCV), 2023
Paola Cascante-Bonilla
Khaled Shehada
James Smith
Sivan Doveh
Donghyun Kim
...
Gül Varol
A. Oliva
Vicente Ordonez
Rogerio Feris
Leonid Karlinsky
VLM
SyDa
330
48
0
30 Mar 2023
Beyond Object Recognition: A New Benchmark towards Object Concept Learning
IEEE International Conference on Computer Vision (ICCV), 2022
Yong-Lu Li
Yue Xu
Xinyu Xu
Xiaohan Mao
Yuan Yao
Siqi Liu
Cewu Lu
OCL
257
10
0
06 Dec 2022
Teaching Structured Vision&Language Concepts to Vision&Language Models
Computer Vision and Pattern Recognition (CVPR), 2022
Sivan Doveh
Assaf Arbelle
Sivan Harary
Yikang Shen
Roei Herzig
...
Donghyun Kim
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLM
CoGe
252
87
0
21 Nov 2022
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions
Yong-Lu Li
Hongwei Fan
Zuoyu Qiu
Yiming Dou
Liang Xu
...
Peiyang Guo
Haisheng Su
Dongliang Wang
Wei Wu
Cewu Lu
137
7
0
14 Nov 2022
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection
European Conference on Computer Vision (ECCV), 2022
Xiaoqian Wu
Yong-Lu Li
Xinpeng Liu
Junyi Zhang
Yuzhe Wu
Cewu Lu
163
47
0
28 Jul 2022
VL-CheckList: Evaluating Pre-trained Vision-Language Models with Objects, Attributes and Relations
Tiancheng Zhao
Tianqi Zhang
Mingwei Zhu
Haozhan Shen
Kyusong Lee
Xiaopeng Lu
Jianwei Yin
VLM
CoGe
MLLM
230
109
0
01 Jul 2022
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions
Computer Vision and Pattern Recognition (CVPR), 2022
Huaizu Jiang
Xiaojian Ma
Weili Nie
Zhiding Yu
Yuke Zhu
Song-Chun Zhu
Anima Anandkumar
VLM
161
47
0
27 May 2022
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
International Conference on Learning Representations (ICLR), 2022
Xiaojian Ma
Weili Nie
Zhiding Yu
Huaizu Jiang
Chaowei Xiao
Yuke Zhu
Song-Chun Zhu
Anima Anandkumar
ViT
LRM
224
20
0
24 Apr 2022
Interactiveness Field in Human-Object Interactions
Computer Vision and Pattern Recognition (CVPR), 2022
Xinpeng Liu
Yong-Lu Li
Xiaoqian Wu
Yu-Wing Tai
Cewu Lu
Chi-Keung Tang
132
56
0
16 Apr 2022
The Overlooked Classifier in Human-Object Interaction Recognition
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Lin Liang
Lei Li
Zicheng Liu
VLM
146
10
0
10 Mar 2022
Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection
AAAI Conference on Artificial Intelligence (AAAI), 2022
Xinpeng Liu
Yong-Lu Li
Cewu Lu
141
16
0
19 Feb 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Yixiang Chen
Xuwu Wang
Yanghua Xiao
N. Yuan
144
220
0
11 Feb 2022
Is Object Detection Necessary for Human-Object Interaction Recognition?
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Zicheng Liu
Lei Li
106
7
0
27 Jul 2021
DecAug: Augmenting HOI Detection via Decomposition
AAAI Conference on Artificial Intelligence (AAAI), 2020
Yichen Xie
Haoshu Fang
Dian Shao
Yong-Lu Li
Cewu Lu
118
10
0
02 Oct 2020
DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection
AAAI Conference on Artificial Intelligence (AAAI), 2020
Haoshu Fang
Yichen Xie
Dian Shao
Cewu Lu
199
62
0
02 Oct 2020
TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model
Computer Vision and Pattern Recognition (CVPR), 2020
Bo Pang
Yizhuo Li
Yifan Zhang
Muchen Li
Cewu Lu
VOT
139
263
0
10 Jun 2020
A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos
Frank F. Xu
Lei Ji
Botian Shi
Junyi Du
Graham Neubig
Yonatan Bisk
Nan Duan
87
21
0
02 May 2020
Recursive Social Behavior Graph for Trajectory Prediction
Jianhua Sun
Qinhong Jiang
Cewu Lu
GNN
118
174
0
22 Apr 2020
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
351
387
0
21 Apr 2020
Detailed 2D-3D Joint Representation for Human-Object Interaction
Computer Vision and Pattern Recognition (CVPR), 2020
Yong-Lu Li
Xinpeng Liu
Han Lu
Shiyi Wang
Junqi Liu
Jiefeng Li
Cewu Lu
3DH
133
149
0
17 Apr 2020
Symmetry and Group in Attribute-Object Compositions
Computer Vision and Pattern Recognition (CVPR), 2020
Yong-Lu Li
Yue Xu
Xiaohan Mao
Cewu Lu
200
139
0
01 Apr 2020
KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations
Computer Vision and Pattern Recognition (CVPR), 2020
Yang You
Yujing Lou
Chengkun Li
Zhoujun Cheng
Liangwei Li
Lizhuang Ma
Weiming Wang
Cewu Lu
3DH
3DV
3DPC
313
77
0
28 Feb 2020
PIQA: Reasoning about Physical Commonsense in Natural Language
AAAI Conference on Artificial Intelligence (AAAI), 2019
Yonatan Bisk
Rowan Zellers
Ronan Le Bras
Jianfeng Gao
Yejin Choi
OOD
LRM
775
2,345
0
26 Nov 2019
Three Branches: Detecting Actions With Richer Features
Jinchao Xia
Jiajun Tang
Cewu Lu
79
8
0
13 Aug 2019
1