ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.06539
  4. Cited By
HAKE: Human Activity Knowledge Engine
v1v2v3v4v5 (latest)

HAKE: Human Activity Knowledge Engine

13 April 2019
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Mingyang Chen
Ze Ma
Shiyi Wang
Haoshu Fang
Cewu Lu
    HAI
ArXiv (abs)PDFHTML

Papers citing "HAKE: Human Activity Knowledge Engine"

39 / 39 papers shown
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-ModelsIEEE International Conference on Robotics and Automation (ICRA), 2025
Chen Wang
Fei Xia
Wenhao Yu
Tingnan Zhang
Ruohan Zhang
Ce Liu
Li Fei-Fei
Jie Tan
Jacky Liang
370
7
0
17 Apr 2025
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Interacted Object Grounding in Spatio-Temporal Human-Object InteractionsAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaoyang Liu
Boran Wen
Xinpeng Liu
Zizheng Zhou
Hongwei Fan
Cewu Lu
Lizhuang Ma
Yulong Chen
Yongqian Li
521
6
0
27 Dec 2024
GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction
GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction
Patrick Kwon
Chen Chen
Hanbyul Joo
335
7
0
17 Oct 2024
NAVERO: Unlocking Fine-Grained Semantics for Video-Language
  Compositionality
NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality
Chaofan Tao
Gukyeong Kwon
Varad Gunjal
Hao Yang
Zhaowei Cai
Yonatan Dukler
Ashwin Swaminathan
R. Manmatha
Colin Jon Taylor
Stefano Soatto
CoGe
213
0
0
18 Aug 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie Yang
Bingliang Li
Ailing Zeng
L. Zhang
Ruimao Zhang
VLM
334
39
0
11 Jun 2024
Diagnosing the Compositional Knowledge of Vision Language Models from a
  Game-Theoretic View
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang
Shichao Dong
Yapeng Zhu
Kelu Yao
Weidong Zhao
Chao Li
Ping Luo
CoGeLRM
288
6
0
27 May 2024
Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human
  Activity Reasoning
Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity ReasoningNeural Information Processing Systems (NeurIPS), 2023
Xiaoqian Wu
Yong-Lu Li
Jianhua Sun
Cewu Lu
199
33
0
29 Nov 2023
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in
  the Real World
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real WorldInternational Conference on Learning Representations (ICLR), 2023
Rujie Wu
Xiaojian Ma
Zhenliang Zhang
Wei Wang
Qing Li
Song-Chun Zhu
Yizhou Wang
LRMVLM
477
18
0
16 Oct 2023
A Grammatical Compositional Model for Video Action Detection
A Grammatical Compositional Model for Video Action Detection
Zhijun Zhang
Xu Zou
Jiahuan Zhou
Sheng Zhong
Ying Wu
300
0
0
04 Oct 2023
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language
  Pretraining?
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?
Haiwei Yang
Liang Ding
Jun Rao
Ye Liu
Li Shen
Changxing Ding
318
27
0
24 Aug 2023
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL
  Models
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL ModelsNeural Information Processing Systems (NeurIPS), 2023
Sivan Doveh
Assaf Arbelle
Sivan Harary
Roei Herzig
Donghyun Kim
...
Yikang Shen
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLMCoGe
506
80
0
31 May 2023
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for
  Improved Vision-Language Compositionality
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language CompositionalityConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Harman Singh
Pengchuan Zhang
Qifan Wang
Mengjiao MJ Wang
Wenhan Xiong
Jingfei Du
Yu Chen
CoGeVLM
476
37
0
23 May 2023
Incorporating Structured Representations into Pretrained Vision &
  Language Models Using Scene Graphs
Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene GraphsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Roei Herzig
Alon Mendelson
Leonid Karlinsky
Assaf Arbelle
Rogerio Feris
Trevor Darrell
Amir Globerson
VLM
371
41
0
10 May 2023
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image
  Generation
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Xu Ju
Ailing Zeng
Chenchen Zhao
Jianan Wang
Lei Zhang
Qian Xu
DiffM
285
133
0
09 Apr 2023
From Isolated Islands to Pangea: Unifying Semantic Space for Human
  Action Understanding
From Isolated Islands to Pangea: Unifying Semantic Space for Human Action UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Yong-Lu Li
Xiaoqian Wu
Xinpeng Liu
Zehao Wang
Yiming Dou
...
Junyi Zhang
Yixing Li
Jingru Tan
Xudong Lu
Cewu Lu
537
19
0
02 Apr 2023
Going Beyond Nouns With Vision & Language Models Using Synthetic Data
Going Beyond Nouns With Vision & Language Models Using Synthetic DataIEEE International Conference on Computer Vision (ICCV), 2023
Paola Cascante-Bonilla
Khaled Shehada
James Smith
Sivan Doveh
Donghyun Kim
...
Gül Varol
A. Oliva
Vicente Ordonez
Rogerio Feris
Leonid Karlinsky
VLMSyDa
498
49
0
30 Mar 2023
Beyond Object Recognition: A New Benchmark towards Object Concept
  Learning
Beyond Object Recognition: A New Benchmark towards Object Concept LearningIEEE International Conference on Computer Vision (ICCV), 2022
Yong-Lu Li
Yue Xu
Xinyu Xu
Xiaohan Mao
Yuan Yao
Siqi Liu
Cewu Lu
OCL
428
11
0
06 Dec 2022
Teaching Structured Vision&Language Concepts to Vision&Language Models
Teaching Structured Vision&Language Concepts to Vision&Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Sivan Doveh
Assaf Arbelle
Sivan Harary
Yikang Shen
Roei Herzig
...
Donghyun Kim
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLMCoGe
394
95
0
21 Nov 2022
Discovering A Variety of Objects in Spatio-Temporal Human-Object
  Interactions
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions
Yong-Lu Li
Hongwei Fan
Zuoyu Qiu
Yiming Dou
Liang Xu
...
Peiyang Guo
Haisheng Su
Dongliang Wang
Wei Wu
Cewu Lu
267
8
0
14 Nov 2022
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI
  Detection
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI DetectionEuropean Conference on Computer Vision (ECCV), 2022
Xiaoqian Wu
Yong-Lu Li
Xinpeng Liu
Junyi Zhang
Yuzhe Wu
Cewu Lu
313
50
0
28 Jul 2022
VL-CheckList: Evaluating Pre-trained Vision-Language Models with
  Objects, Attributes and Relations
VL-CheckList: Evaluating Pre-trained Vision-Language Models with Objects, Attributes and Relations
Tiancheng Zhao
Tianqi Zhang
Mingwei Zhu
Haozhan Shen
Kyusong Lee
Xiaopeng Lu
Jianwei Yin
VLMCoGeMLLM
383
119
0
01 Jul 2022
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object
  Interactions
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object InteractionsComputer Vision and Pattern Recognition (CVPR), 2022
Huaizu Jiang
Xiaojian Ma
Weili Nie
Zhiding Yu
Yuke Zhu
Song-Chun Zhu
Anima Anandkumar
VLM
297
50
0
27 May 2022
RelViT: Concept-guided Vision Transformer for Visual Relational
  Reasoning
RelViT: Concept-guided Vision Transformer for Visual Relational ReasoningInternational Conference on Learning Representations (ICLR), 2022
Xiaojian Ma
Weili Nie
Zhiding Yu
Huaizu Jiang
Chaowei Xiao
Yuke Zhu
Song-Chun Zhu
Anima Anandkumar
ViTLRM
361
20
0
24 Apr 2022
Interactiveness Field in Human-Object Interactions
Interactiveness Field in Human-Object InteractionsComputer Vision and Pattern Recognition (CVPR), 2022
Xinpeng Liu
Yong-Lu Li
Xiaoqian Wu
Yu-Wing Tai
Cewu Lu
Chi-Keung Tang
270
62
0
16 Apr 2022
The Overlooked Classifier in Human-Object Interaction Recognition
The Overlooked Classifier in Human-Object Interaction Recognition
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Lin Liang
Lei Li
Zicheng Liu
VLM
230
10
0
10 Mar 2022
Highlighting Object Category Immunity for the Generalization of
  Human-Object Interaction Detection
Highlighting Object Category Immunity for the Generalization of Human-Object Interaction DetectionAAAI Conference on Artificial Intelligence (AAAI), 2022
Xinpeng Liu
Yong-Lu Li
Cewu Lu
257
16
0
19 Feb 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
Multi-Modal Knowledge Graph Construction and Application: A SurveyIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Yixiang Chen
Xuwu Wang
Yanghua Xiao
N. Yuan
304
256
0
11 Feb 2022
Is Object Detection Necessary for Human-Object Interaction Recognition?
Is Object Detection Necessary for Human-Object Interaction Recognition?
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Zicheng Liu
Lei Li
213
7
0
27 Jul 2021
DecAug: Augmenting HOI Detection via Decomposition
DecAug: Augmenting HOI Detection via DecompositionAAAI Conference on Artificial Intelligence (AAAI), 2020
Yichen Xie
Haoshu Fang
Dian Shao
Yong-Lu Li
Cewu Lu
255
10
0
02 Oct 2020
DIRV: Dense Interaction Region Voting for End-to-End Human-Object
  Interaction Detection
DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction DetectionAAAI Conference on Artificial Intelligence (AAAI), 2020
Haoshu Fang
Yichen Xie
Dian Shao
Cewu Lu
354
65
0
02 Oct 2020
TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training
  Model
TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training ModelComputer Vision and Pattern Recognition (CVPR), 2020
Bo Pang
Yizhuo Li
Yifan Zhang
Muchen Li
Cewu Lu
VOT
240
269
0
10 Jun 2020
A Benchmark for Structured Procedural Knowledge Extraction from Cooking
  Videos
A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos
Frank F. Xu
Lei Ji
Ding Wang
Junyi Du
Graham Neubig
Yonatan Bisk
Nan Duan
174
22
0
02 May 2020
Recursive Social Behavior Graph for Trajectory Prediction
Recursive Social Behavior Graph for Trajectory Prediction
Jianhua Sun
Qinhong Jiang
Cewu Lu
GNN
241
185
0
22 Apr 2020
Experience Grounds Language
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
662
422
0
21 Apr 2020
Detailed 2D-3D Joint Representation for Human-Object Interaction
Detailed 2D-3D Joint Representation for Human-Object InteractionComputer Vision and Pattern Recognition (CVPR), 2020
Yong-Lu Li
Xinpeng Liu
Han Lu
Shiyi Wang
Junqi Liu
Jiefeng Li
Cewu Lu
3DH
237
155
0
17 Apr 2020
Symmetry and Group in Attribute-Object Compositions
Symmetry and Group in Attribute-Object CompositionsComputer Vision and Pattern Recognition (CVPR), 2020
Yong-Lu Li
Yue Xu
Xiaohan Mao
Cewu Lu
444
153
0
01 Apr 2020
KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous
  Human Annotations
KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human AnnotationsComputer Vision and Pattern Recognition (CVPR), 2020
Yang You
Yujing Lou
Chengkun Li
Zhoujun Cheng
Liangwei Li
Lizhuang Ma
Weiming Wang
Cewu Lu
3DH3DV3DPC
641
87
0
28 Feb 2020
PIQA: Reasoning about Physical Commonsense in Natural Language
PIQA: Reasoning about Physical Commonsense in Natural LanguageAAAI Conference on Artificial Intelligence (AAAI), 2019
Yonatan Bisk
Rowan Zellers
Ronan Le Bras
Jianfeng Gao
Yejin Choi
OODLRM
3.2K
2,818
0
26 Nov 2019
Three Branches: Detecting Actions With Richer Features
Three Branches: Detecting Actions With Richer Features
Jinchao Xia
Jiajun Tang
Cewu Lu
141
8
0
13 Aug 2019
1
Page 1 of 1