v1v2v3v4v5 (latest)

HAKE: Human Activity Knowledge Engine

13 April 2019

Papers citing "HAKE: Human Activity Knowledge Engine"

39 / 39 papers shown

Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-ModelsIEEE International Conference on Robotics and Automation (ICRA), 2025

370

17 Apr 2025

Interacted Object Grounding in Spatio-Temporal Human-Object InteractionsAAAI Conference on Artificial Intelligence (AAAI), 2024

521

27 Dec 2024

GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction

Patrick Kwon

Chen Chen

Hanbyul Joo

335

17 Oct 2024

NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality

Hao Yang

Ashwin Swaminathan

Colin Jon Taylor

213

18 Aug 2024

Open-World Human-Object Interaction Detection via Multi-modal Prompts

334

11 Jun 2024

Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View

Jin Wang

Ping Luo

288

27 May 2024

Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity ReasoningNeural Information Processing Systems (NeurIPS), 2023

199

29 Nov 2023

Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real WorldInternational Conference on Learning Representations (ICLR), 2023

Xiaojian Ma

477

16 Oct 2023

A Grammatical Compositional Model for Video Action Detection

Ying Wu

300

04 Oct 2023

Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?

Liang Ding

Li Shen

318

24 Aug 2023

Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL ModelsNeural Information Processing Systems (NeurIPS), 2023

...

506

31 May 2023

Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language CompositionalityConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

476

23 May 2023

Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene GraphsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

371

10 May 2023

HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image GenerationIEEE International Conference on Computer Vision (ICCV), 2023

Ailing Zeng

Lei Zhang

285

133

09 Apr 2023

From Isolated Islands to Pangea: Unifying Semantic Space for Human Action UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023

...

537

02 Apr 2023

Going Beyond Nouns With Vision & Language Models Using Synthetic DataIEEE International Conference on Computer Vision (ICCV), 2023

Paola Cascante-Bonilla

...

498

30 Mar 2023

Beyond Object Recognition: A New Benchmark towards Object Concept LearningIEEE International Conference on Computer Vision (ICCV), 2022

428

06 Dec 2022

Teaching Structured Vision&Language Concepts to Vision&Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022

...

394

21 Nov 2022

Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions

...

Haisheng Su

267

14 Nov 2022

Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI DetectionEuropean Conference on Computer Vision (ECCV), 2022

313

28 Jul 2022

VL-CheckList: Evaluating Pre-trained Vision-Language Models with Objects, Attributes and Relations

383

119

01 Jul 2022

Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object InteractionsComputer Vision and Pattern Recognition (CVPR), 2022

Xiaojian Ma

297

27 May 2022

RelViT: Concept-guided Vision Transformer for Visual Relational ReasoningInternational Conference on Learning Representations (ICLR), 2022

Xiaojian Ma

361

24 Apr 2022

Interactiveness Field in Human-Object InteractionsComputer Vision and Pattern Recognition (CVPR), 2022

270

16 Apr 2022

The Overlooked Classifier in Human-Object Interaction Recognition

Zicheng Liu

230

10 Mar 2022

Highlighting Object Category Immunity for the Generalization of Human-Object Interaction DetectionAAAI Conference on Artificial Intelligence (AAAI), 2022

Xinpeng Liu

Yong-Lu Li

Cewu Lu

257

19 Feb 2022

Multi-Modal Knowledge Graph Construction and Application: A SurveyIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022

Zhixu Li

304

256

11 Feb 2022

Is Object Detection Necessary for Human-Object Interaction Recognition?

Zicheng Liu

213

27 Jul 2021

DecAug: Augmenting HOI Detection via DecompositionAAAI Conference on Artificial Intelligence (AAAI), 2020

255

02 Oct 2020

DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction DetectionAAAI Conference on Artificial Intelligence (AAAI), 2020

354

02 Oct 2020

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training ModelComputer Vision and Pattern Recognition (CVPR), 2020

240

269

10 Jun 2020

A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos

Graham Neubig

174

02 May 2020

Recursive Social Behavior Graph for Trajectory Prediction

241

185

22 Apr 2020

Experience Grounds Language

...

662

422

21 Apr 2020

Detailed 2D-3D Joint Representation for Human-Object InteractionComputer Vision and Pattern Recognition (CVPR), 2020

Jiefeng Li

237

155

17 Apr 2020

Symmetry and Group in Attribute-Object CompositionsComputer Vision and Pattern Recognition (CVPR), 2020

444

153

01 Apr 2020

KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human AnnotationsComputer Vision and Pattern Recognition (CVPR), 2020

641

28 Feb 2020

PIQA: Reasoning about Physical Commonsense in Natural LanguageAAAI Conference on Artificial Intelligence (AAAI), 2019

Yejin Choi

3.2K

2,818

26 Nov 2019

Three Branches: Detecting Actions With Richer Features

Jinchao Xia

Jiajun Tang

Cewu Lu

141

13 Aug 2019