ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.19972
  4. Cited By
HumanVLA: Towards Vision-Language Directed Object Rearrangement by
  Physical Humanoid

HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid

28 June 2024
Xinyu Xu
Yizheng Zhang
Yong-Lu Li
Lei Han
Cewu Lu
ArXivPDFHTML

Papers citing "HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid"

10 / 10 papers shown
Title
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
Liang Pan
Zeshi Yang
Zhiyang Dou
Wenjia Wang
Buzhen Huang
Bo Dai
Taku Komura
Jingbo Wang
45
1
0
25 Mar 2025
Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics
Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics
Zekai Deng
Ye-ling Shi
Kaiyang Ji
Lan Xu
Shaoli Huang
Jingya Wang
50
0
0
24 Mar 2025
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
Boran Wen
Dingbang Huang
Zichen Zhang
J. Zhou
Jianbin Deng
Jingyu Gong
Yulong Chen
Lizhuang Ma
Y. Li
3DH
47
0
0
20 Mar 2025
Human-Centric Foundation Models: Perception, Generation and Agentic Modeling
Human-Centric Foundation Models: Perception, Generation and Agentic Modeling
Shixiang Tang
Y. Wang
Lu Chen
Yuan Wang
Sida Peng
Dan Xu
W. Ouyang
VGen
125
2
0
12 Feb 2025
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion
  and Manipulation
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
37
30
0
15 Mar 2024
3D-VLA: A 3D Vision-Language-Action Generative World Model
3D-VLA: A 3D Vision-Language-Action Generative World Model
Haoyu Zhen
Xiaowen Qiu
Peihao Chen
Jincheng Yang
Xin Yan
Yilun Du
Yining Hong
Chuang Gan
LM&Ro
VGen
PINN
34
81
0
14 Mar 2024
Scaling Up Dynamic Human-Scene Interaction Modeling
Scaling Up Dynamic Human-Scene Interaction Modeling
Nan Jiang
Zhiyuan Zhang
Hongjie Li
Xiaoxuan Ma
Zan Wang
Yixin Chen
Tengyu Liu
Yixin Zhu
Siyuan Huang
30
51
0
13 Mar 2024
CALM: Conditional Adversarial Latent Models for Directable Virtual
  Characters
CALM: Conditional Adversarial Latent Models for Directable Virtual Characters
Chen Tessler
Yoni Kasten
Yunrong Guo
Shie Mannor
Gal Chechik
Xue Bin Peng
VGen
LM&Ro
31
72
0
02 May 2023
What Matters in Learning from Offline Human Demonstrations for Robot
  Manipulation
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
139
461
0
06 Aug 2021
PointNet: Deep Learning on Point Sets for 3D Classification and
  Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
219
13,886
0
02 Dec 2016
1