ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.08154
  4. Cited By
Detailed 2D-3D Joint Representation for Human-Object Interaction
v1v2 (latest)

Detailed 2D-3D Joint Representation for Human-Object Interaction

Computer Vision and Pattern Recognition (CVPR), 2020
17 April 2020
Yong-Lu Li
Xinpeng Liu
Han Lu
Shiyi Wang
Junqi Liu
Jiefeng Li
Cewu Lu
    3DH
ArXiv (abs)PDFHTMLGithub (101★)

Papers citing "Detailed 2D-3D Joint Representation for Human-Object Interaction"

50 / 91 papers shown
Title
End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer
End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer
Yonghui Yu
Jiahang Cai
Xun Wang
Wenwu Yang
ViT
57
0
0
17 Nov 2025
ChangingGrounding: 3D Visual Grounding in Changing Scenes
ChangingGrounding: 3D Visual Grounding in Changing Scenes
Miao Hu
Zhiwei Huang
Tai Wang
Jiangmiao Pang
Dahua Lin
Nanning Zheng
R. Xu
VGen
77
0
0
16 Oct 2025
MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions
MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions
Kaen Kogashi
Anoop Cherian
Meng-Yu Jennifer Kuo
108
0
0
09 Oct 2025
Person Identification from Egocentric Human-Object Interactions using 3D Hand Pose
Person Identification from Egocentric Human-Object Interactions using 3D Hand Pose
Muhammad Hamza
Danish Hamid
Muhammad Tahir Akram
65
0
0
20 Sep 2025
No More Sibling Rivalry: Debiasing Human-Object Interaction Detection
No More Sibling Rivalry: Debiasing Human-Object Interaction Detection
Bin Yang
Yulin Zhang
Hong-Yu Zhou
Sibei Yang
128
0
0
31 Aug 2025
QueryCraft: Transformer-Guided Query Initialization for Enhanced Human-Object Interaction Detection
QueryCraft: Transformer-Guided Query Initialization for Enhanced Human-Object Interaction Detection
Yuxiao Wang
Wolin Liang
Yu Lei
Weiying Xue
Nan Zhuang
Qi Liu
72
0
0
12 Aug 2025
Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection
Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection
Francesco Tonini
Lorenzo Vaquero
Alessandro Conti
Cigdem Beyan
Elisa Ricci
103
0
0
23 Jul 2025
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
Shoubin Yu
Yue Zhang
Ziyang Wang
Jaehong Yoon
Mohit Bansal
MoELRM
117
3
0
20 Jun 2025
An Image-like Diffusion Method for Human-Object Interaction Detection
An Image-like Diffusion Method for Human-Object Interaction DetectionComputer Vision and Pattern Recognition (CVPR), 2025
Xiaofei Hui
Haoxuan Qu
Hossein Rahmani
Jun Liu
DiffM
283
1
0
23 Mar 2025
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
Reconstructing In-the-Wild Open-Vocabulary Human-Object InteractionsComputer Vision and Pattern Recognition (CVPR), 2025
Boran Wen
Dingbang Huang
Zichen Zhang
Jingren Zhou
Jianbin Deng
Jingyu Gong
Yulong Chen
Lizhuang Ma
Yongqian Li
3DH
300
4
0
20 Mar 2025
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Interacted Object Grounding in Spatio-Temporal Human-Object InteractionsAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaoyang Liu
Boran Wen
Xinpeng Liu
Zizheng Zhou
Hongwei Fan
Cewu Lu
Lizhuang Ma
Yulong Chen
Yongqian Li
388
3
0
27 Dec 2024
Human-Object Interaction Detection Collaborated with Large
  Relation-driven Diffusion Models
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2024
Liulei Li
Wenguan Wang
Yue Yang
197
19
0
26 Oct 2024
A Review of Human-Object Interaction Detection
A Review of Human-Object Interaction Detection
Yuxiao Wang
Qiwei Xiong
Yu Lei
Weiying Xue
Qi Liu
Zhenao Wei
195
8
0
20 Aug 2024
UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection
UAHOI: Uncertainty-aware Robust Interaction Learning for HOI DetectionComputer Vision and Image Understanding (CVIU), 2024
Mu Chen
Minghan Chen
Yi Yang
261
11
0
14 Aug 2024
An analysis of HOI: using a training-free method with multimodal visual
  foundation models when only the test set is available, without the training
  set
An analysis of HOI: using a training-free method with multimodal visual foundation models when only the test set is available, without the training set
Chaoyi Ai
VLM
212
0
0
11 Aug 2024
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI DetectionEuropean Conference on Computer Vision (ECCV), 2024
Ting Lei
Shaofeng Yin
Yuxin Peng
Yang Liu
VLM
265
20
0
05 Aug 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie Yang
Bingliang Li
Ailing Zeng
L. Zhang
Ruimao Zhang
VLM
230
29
0
11 Jun 2024
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances,
  and Future Directions
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future DirectionsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Daizong Liu
Yang Liu
Wencan Huang
Wei Hu
LM&Ro
305
29
0
09 Jun 2024
Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs
  Collaborated Reasoning
Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning
Hang Zhang
Wenxiao Zhang
Haoxuan Qu
Jun Liu
224
10
0
15 Mar 2024
FreeA: Human-object Interaction Detection using Free Annotation Labels
FreeA: Human-object Interaction Detection using Free Annotation Labels
Qi Liu
Yuxiao Wang
Xinyu Jiang
Yu Lei
Zhenao Wei
Yu Lei
Nan Zhuang
Weiying Xue
VLM
229
1
0
04 Mar 2024
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Shoubin Yu
Jaehong Yoon
Mohit Bansal
400
14
0
08 Feb 2024
Primitive-based 3D Human-Object Interaction Modelling and Programming
Primitive-based 3D Human-Object Interaction Modelling and Programming
Siqi Liu
Yong-Lu Li
Zhou Fang
Xinpeng Liu
Yang You
Cewu Lu
203
8
0
17 Dec 2023
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
LEMON: Learning 3D Human-Object Interaction Relation from 2D ImagesComputer Vision and Pattern Recognition (CVPR), 2023
Yuhang Yang
Wei Zhai
Hongcheng Luo
Yang Cao
Zheng-Jun Zha
264
39
0
14 Dec 2023
Revisit Human-Scene Interaction via Space Occupancy
Revisit Human-Scene Interaction via Space OccupancyEuropean Conference on Computer Vision (ECCV), 2023
Xinpeng Liu
Haowen Hou
Yanchao Yang
Yong-Lu Li
Cewu Lu
338
20
0
05 Dec 2023
Disentangled Interaction Representation for One-Stage Human-Object
  Interaction Detection
Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Xubin Zhong
Changxing Ding
Yupeng Hu
Dacheng Tao
175
0
0
04 Dec 2023
Generating Human-Centric Visual Cues for Human-Object Interaction
  Detection via Large Vision-Language Models
Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models
Yu-Wei Zhan
Fan Liu
Xin Luo
Liqiang Nie
Xin-Shun Xu
Mohan S. Kankanhalli
VLM
197
0
0
26 Nov 2023
Neural-Logic Human-Object Interaction Detection
Neural-Logic Human-Object Interaction Detection
Liulei Li
Jianan Wei
Wenguan Wang
Yi Yang
229
39
0
16 Nov 2023
Detecting Any Human-Object Interaction Relationship: Universal HOI
  Detector with Spatial Prompt Learning on Foundation Models
Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation ModelsNeural Information Processing Systems (NeurIPS), 2023
Yichao Cao
Qingfei Tang
Xiu Su
Chen Song
Shan You
Xiaobo Lu
Chang Xu
232
42
0
07 Nov 2023
Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge
  Distillation at Multiple Levels
Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple LevelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Bo Wan
Tinne Tuytelaars
VLM
265
6
0
10 Sep 2023
Agglomerative Transformer for Human-Object Interaction Detection
Agglomerative Transformer for Human-Object Interaction DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Danyang Tu
Wei Sun
Guangtao Zhai
Wei Shen
ViT
189
16
0
16 Aug 2023
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic
  Correlations for Language-guided HOI detection
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detectionIEEE International Conference on Computer Vision (ICCV), 2023
Yichao Cao
Qingfei Tang
Fengyuan Yang
Xiu Su
Shan You
Xiaobo Lu
Chang Xu
247
25
0
25 Jul 2023
Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection
Guangzhi Wang
Yangyang Guo
Mohan S. Kankanhalli
265
0
0
19 Jul 2023
Focusing on what to decode and what to train: Efficient Training with
  HOI Split Decoders and Specific Target Guided DeNoising
Focusing on what to decode and what to train: Efficient Training with HOI Split Decoders and Specific Target Guided DeNoisingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Junwen Chen
Yingchen Wang
Keiji Yanai
358
0
0
05 Jul 2023
Exploiting Multimodal Synthetic Data for Egocentric Human-Object
  Interaction Detection in an Industrial Scenario
Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial ScenarioComputer Vision and Image Understanding (CVIU), 2023
Rosario Leonardi
Francesco Ragusa
Antonino Furnari
G. Farinella
210
17
0
21 Jun 2023
Boosting Human-Object Interaction Detection with Text-to-Image Diffusion
  Model
Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model
Jie Yang
Bing Li
Fengyu Yang
Ailing Zeng
Lei Zhang
Ruimao Zhang
VLMDiffM
246
29
0
20 May 2023
From Isolated Islands to Pangea: Unifying Semantic Space for Human
  Action Understanding
From Isolated Islands to Pangea: Unifying Semantic Space for Human Action UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Yong-Lu Li
Xiaoqian Wu
Xinpeng Liu
Zehao Wang
Yiming Dou
...
Junyi Zhang
Yixing Li
Jingru Tan
Xudong Lu
Cewu Lu
370
19
0
02 Apr 2023
HOICLIP: Efficient Knowledge Transfer for HOI Detection with
  Vision-Language Models
HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Sha Ning
Longtian Qiu
Yongfei Liu
Xuming He
VLM
317
69
0
28 Mar 2023
PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with
  Progressive Video Transformers
PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video TransformersComputer Vision and Pattern Recognition (CVPR), 2023
Zhongwei Qiu
Qiansheng Yang
Jian Wang
Haocheng Feng
Junyu Han
Errui Ding
Chang-hui Xu
Dongmei Fu
Jingdong Wang
ViT
181
41
0
16 Mar 2023
Unified Visual Relationship Detection with Vision and Language Models
Unified Visual Relationship Detection with Vision and Language ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Long Zhao
Liangzhe Yuan
Boqing Gong
Huayu Chen
Florian Schroff
Ming-Hsuan Yang
Hartwig Adam
Ting Liu
ObjD
242
12
0
16 Mar 2023
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation
  Learning
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation LearningInternational Conference on Learning Representations (ICLR), 2023
Bo Wan
Yongfei Liu
Desen Zhou
Tinne Tuytelaars
Xuming He
111
16
0
02 Mar 2023
Self-Supervised Category-Level Articulated Object Pose Estimation with
  Part-Level SE(3) Equivariance
Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) EquivarianceInternational Conference on Learning Representations (ICLR), 2023
Xueyi Liu
Ji Zhang
Ruizhen Hu
Haibin Huang
He Wang
Li Yi
3DPC
230
29
0
28 Feb 2023
Parallel Reasoning Network for Human-Object Interaction Detection
Parallel Reasoning Network for Human-Object Interaction Detection
Huan Peng
Fenggang Liu
Yangguang Li
Bin Huang
Jing Shao
Nong Sang
Changxin Gao
279
8
0
09 Jan 2023
Full-Body Articulated Human-Object Interaction
Full-Body Articulated Human-Object InteractionIEEE International Conference on Computer Vision (ICCV), 2022
Nan Jiang
Tengyu Liu
Zhexuan Cao
Jieming Cui
Zhiyuan Zhang
Yixin Chen
Heng Wang
Yixin Zhu
Siyuan Huang
320
67
0
20 Dec 2022
Beyond Object Recognition: A New Benchmark towards Object Concept
  Learning
Beyond Object Recognition: A New Benchmark towards Object Concept LearningIEEE International Conference on Computer Vision (ICCV), 2022
Yong-Lu Li
Yue Xu
Xinyu Xu
Xiaohan Mao
Yuan Yao
Siqi Liu
Cewu Lu
OCL
329
10
0
06 Dec 2022
Weakly-supervised Pre-training for 3D Human Pose Estimation via
  Perspective Knowledge
Weakly-supervised Pre-training for 3D Human Pose Estimation via Perspective KnowledgePattern Recognition (Pattern Recogn.), 2022
Zhongwei Qiu
Kai Qiu
Jianlong Fu
Dongmei Fu
3DH
126
31
0
22 Nov 2022
Discovering A Variety of Objects in Spatio-Temporal Human-Object
  Interactions
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions
Yong-Lu Li
Hongwei Fan
Zuoyu Qiu
Yiming Dou
Liang Xu
...
Peiyang Guo
Haisheng Su
Dongliang Wang
Wei Wu
Cewu Lu
149
7
0
14 Nov 2022
D&D: Learning Human Dynamics from Dynamic Camera
D&D: Learning Human Dynamics from Dynamic CameraEuropean Conference on Computer Vision (ECCV), 2022
Jiefeng Li
Siyuan Bian
Chaoshun Xu
Gang Liu
Gang Yu
Cewu Lu
173
45
0
19 Sep 2022
IVT: An End-to-End Instance-guided Video Transformer for 3D Pose
  Estimation
IVT: An End-to-End Instance-guided Video Transformer for 3D Pose EstimationACM Multimedia (ACM MM), 2022
Zhongwei Qiu
Qiansheng Yang
Jian Wang
Dongmei Fu
ViT
176
12
0
06 Aug 2022
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI
  Detection
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI DetectionEuropean Conference on Computer Vision (ECCV), 2022
Xiaoqian Wu
Yong-Lu Li
Xinpeng Liu
Junyi Zhang
Yuzhe Wu
Cewu Lu
211
47
0
28 Jul 2022
Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation
Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation
Zhouping Wang
Sarah Ostadabbas
3DH
141
6
0
25 Jul 2022
12
Next