ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.09631
  4. Cited By
3D-VLA: A 3D Vision-Language-Action Generative World Model

3D-VLA: A 3D Vision-Language-Action Generative World Model

International Conference on Machine Learning (ICML), 2024
14 March 2024
Haoyu Zhen
Xiaowen Qiu
Peihao Chen
Jincheng Yang
Xin Yan
Yilun Du
Yining Hong
Chuang Gan
    LM&RoVGenPINN
ArXiv (abs)PDFHTMLHuggingFace (10 upvotes)

Papers citing "3D-VLA: A 3D Vision-Language-Action Generative World Model"

50 / 141 papers shown
VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling
VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling
Weiqi Li
Quande Zhang
Ruifeng Zhai
Liang Lin
Guangrun Wang
194
1
0
02 Dec 2025
LISA-3D: Lifting Language-Image Segmentation to 3D via Multi-View Consistency
Zhongbin Guo
Jiahe Liu
Wenyu Gao
Yushan Li
Chengzhi Li
Ping Jian
71
0
0
30 Nov 2025
SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
Chaojun Ni
Cheng Chen
Xiaofeng Wang
Zheng Zhu
Wenzhao Zheng
...
Qiang Zhang
Yun Ye
Yang Wang
Guan Huang
Wenjun Mei
105
0
0
30 Nov 2025
IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation
IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation
Y. Li
Lichi Li
Anh Dao
Xinyu Zhou
Yicheng Qiao
...
Daeun Lee
Z. Chen
Zhen Tan
Mohit Bansal
Yu Kong
156
0
0
21 Nov 2025
RynnVLA-002: A Unified Vision-Language-Action and World Model
RynnVLA-002: A Unified Vision-Language-Action and World Model
Jun Cen
Siteng Huang
Yuqian Yuan
Kehan Li
Hangjie Yuan
...
Xin Li
Hao Luo
Fan Wang
Deli Zhao
H. Chen
VGenSyDa
317
0
0
21 Nov 2025
VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation
VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation
Hanyu Zhou
Chuanhao Ma
Gim Hee Lee
191
0
0
21 Nov 2025
BridgeEQA: Virtual Embodied Agents for Real Bridge Inspections
BridgeEQA: Virtual Embodied Agents for Real Bridge Inspections
Subin Varghese
Joshua Gao
Asad Ur Rahman
Vedhus Hoskere
160
0
0
16 Nov 2025
A Step Toward World Models: A Survey on Robotic Manipulation
A Step Toward World Models: A Survey on Robotic Manipulation
Peng-Fei Zhang
Ying Cheng
Xiaofan Sun
S. Wang
Lei Zhu
Lei Zhu
Heng Tao Shen
LM&Ro
745
2
0
31 Oct 2025
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
Xu Zheng
Zihao Dongfang
Lutao Jiang
Boyuan Zheng
Yulong Guo
...
L. Zhang
Danda Pani Paudel
Nicu Sebe
Luc Van Gool
Xuming Hu
LRMVLM
712
4
0
29 Oct 2025
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors
Z. Zhang
Hao Li
Yalun Dai
Zhengbang Zhu
Lei Zhou
...
S. Chen
Ziwei Liu
Y. Liu
Xinghang Li
Pan Zhou
105
1
0
20 Oct 2025
QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision-Language-Action Models
QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision-Language-Action Models
Y. Li
Yihao Chen
Mingcai Zhou
Haoran Li
Zhengtao Zhang
Dongbin Zhao
VLM
114
1
0
16 Oct 2025
DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning
DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning
Tianyuan Yuan
Yicheng Liu
Chenhao Lu
Zhuoguang Chen
Tao Jiang
Hang Zhao
VLM
121
0
0
15 Oct 2025
HiMaCon: Discovering Hierarchical Manipulation Concepts from Unlabeled Multi-Modal Data
HiMaCon: Discovering Hierarchical Manipulation Concepts from Unlabeled Multi-Modal Data
Ruizhe Liu
Pei Zhou
Qian Luo
Li Sun
Jun Cen
Yibing Song
Yanchao Yang
SSL
398
0
0
13 Oct 2025
X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model
X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model
Jinliang Zheng
Jianxiong Li
Zhihao Wang
Dongxiu Liu
Xirui Kang
...
Ya-Qin Zhang
Jiangmiao Pang
Jingjing Liu
Tai Wang
Xianyuan Zhan
LM&Ro
232
12
0
11 Oct 2025
VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation
VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation
Shaoqi Dong
Chaoyou Fu
Haihan Gao
Y. Zhang
Chi Yan
...
H. Cao
Yang Gao
Xing Sun
Ran He
Caifeng Shan
VLM
164
1
0
10 Oct 2025
Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications
Vision-Language-Action Models for Robotics: A Review Towards Real-World ApplicationsIEEE Access (IEEE Access), 2025
Kento Kawaharazuka
Jihoon Oh
Jun Yamada
Ingmar Posner
Yuke Zhu
LM&Ro
261
24
0
08 Oct 2025
Avi: Action from Volumetric Inference
Avi: Action from Volumetric Inference
Harris Song
Long Le
VGenLM&Ro
116
0
0
07 Oct 2025
NoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation
NoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation
Zheng Huang
Mingyu Liu
Xiaoyi Lin
Huanyi Zheng
Canyu Zhao
...
Xiaoman Li
Yiduo Jia
Hao Zhong
Hao Chen
Chunhua Shen
105
1
0
04 Oct 2025
MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Zhuoyang Liu
Jiaming Liu
Jiadong Xu
Nuowei Han
Chenyang Gu
...
Kai Chin Hsieh
K. Wu
Zhengping Che
Yong Dai
Shanghang Zhang
LM&Ro
124
4
0
30 Sep 2025
dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought
dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought
Junjie Wen
Minjie Zhu
Jiaming Liu
Zhiyuan Liu
Yicun Yang
Linfeng Zhang
Shanghang Zhang
Yichen Zhu
Yi Xu
124
3
0
30 Sep 2025
Transferring Vision-Language-Action Models to Industry Applications: Architectures, Performance, and Challenges
Transferring Vision-Language-Action Models to Industry Applications: Architectures, Performance, and Challenges
Shuai Li
Chen Yizhe
Li Dong
Liu Sichao
Lan Dapeng
Liu Yu
Zhibo Pang
LM&Ro
117
1
0
27 Sep 2025
MoWM: Mixture-of-World-Models for Embodied Planning via Latent-to-Pixel Feature Modulation
MoWM: Mixture-of-World-Models for Embodied Planning via Latent-to-Pixel Feature Modulation
Yu Shang
Yangcheng Yu
Xin Zhang
Xin Jin
Haisheng Su
Wei Wu
Yong Li
VGen
171
1
0
26 Sep 2025
Pixel Motion Diffusion is What We Need for Robot Control
Pixel Motion Diffusion is What We Need for Robot Control
E-Ro Nguyen
Y. Zhang
Kanchana Ranasinghe
Xiang Li
Michael S. Ryoo
DiffM
140
0
0
26 Sep 2025
Generalist Robot Manipulation beyond Action Labeled Data
Generalist Robot Manipulation beyond Action Labeled Data
Alexander Spiridonov
Jan-Nico Zaech
Nikolay Nikolov
Luc Van Gool
Danda Pani Paudel
124
1
0
24 Sep 2025
Pure Vision Language Action (VLA) Models: A Comprehensive Survey
Pure Vision Language Action (VLA) Models: A Comprehensive Survey
Dapeng Zhang
Jin Sun
Chenghui Hu
Xiaoyan Wu
Zhenlong Yuan
R. Zhou
Fei Shen
Qingguo Zhou
LM&Ro
295
15
0
23 Sep 2025
VLA-LPAF: Lightweight Perspective-Adaptive Fusion for Vision-Language-Action to Enable More Unconstrained Robotic Manipulation
VLA-LPAF: Lightweight Perspective-Adaptive Fusion for Vision-Language-Action to Enable More Unconstrained Robotic Manipulation
Jinyue Bian
Zhaoxing Zhang
Zhengyu Liang
Shiwei Zheng
Shengtao Zhang
Rong Shen
Chen Yang
Anzhou Hou
124
0
0
18 Sep 2025
CRAFT: Coaching Reinforcement Learning Autonomously using Foundation Models for Multi-Robot Coordination Tasks
CRAFT: Coaching Reinforcement Learning Autonomously using Foundation Models for Multi-Robot Coordination Tasks
Seoyeon Choi
Kanghyun Ryu
Jonghoon Ock
Negar Mehr
176
0
0
17 Sep 2025
Maps for Autonomous Driving: Full-process Survey and Frontiers
Maps for Autonomous Driving: Full-process Survey and Frontiers
Pengxin Chen
Zhipeng Luo
Xiaoqi Jiang
Zhangcai Yin
Jonathan Li
136
0
0
16 Sep 2025
Igniting VLMs toward the Embodied Space
Igniting VLMs toward the Embodied Space
Andy Zhai
B. Liu
Bruno Fang
Chalse Cai
Ellie Ma
...
Shalfun Li
Starrick Liu
S. Chen
Vincent Chen
Zach Xu
LM&RoVLM
195
9
0
15 Sep 2025
RoboChemist: Long-Horizon and Safety-Compliant Robotic Chemical Experimentation
RoboChemist: Long-Horizon and Safety-Compliant Robotic Chemical Experimentation
Z. Zhang
Chenghao Yue
Haobo Xu
Minwen Liao
Xianglin Qi
Huan-ang Gao
Ziwei Wang
Hang Zhao
148
1
0
10 Sep 2025
RoboMatch: A Unified Mobile-Manipulation Teleoperation Platform with Auto-Matching Network Architecture for Long-Horizon Tasks
RoboMatch: A Unified Mobile-Manipulation Teleoperation Platform with Auto-Matching Network Architecture for Long-Horizon Tasks
Hanyu Liu
Yunsheng Ma
Jiaxin Huang
Keqiang Ren
Jiayi Wen
...
Pan Li
Jiejun Hou
Haoru Luan
Zhihua Wang
Zhigong Song
148
0
0
10 Sep 2025
U-ARM : Ultra low-cost general teleoperation interface for robot manipulation
U-ARM : Ultra low-cost general teleoperation interface for robot manipulation
Yanwen Zou
Zhaoye Zhou
Chenyang Shi
Zewei Ye
Junda Huang
Yan Ding
Bo Zhao
200
0
0
02 Sep 2025
Planning with Reasoning using Vision Language World Model
Planning with Reasoning using Vision Language World Model
Delong Chen
Theo Moutakanni
Willy Chung
Yejin Bang
Ziwei Ji
Allen Bolourchi
Pascale Fung
VGenVLM
262
9
0
02 Sep 2025
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots
Minghuan Liu
Zhengbang Zhu
Xiaoshen Han
Peng Hu
Haotong Lin
...
Xinghang Li
Yong Yu
Weinan Zhang
Tao Kong
Bingyi Kang
129
2
0
02 Sep 2025
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Zezeng Li
Alexandre Chapin
Enda Xiang
Rui Yang
Bruno Machado
Na Lei
Emmanuel Dellandrea
Di Huang
Liming Chen
255
3
0
24 Aug 2025
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning
Yijun Liu
Yuwei Liu
Yuan Meng
J. Zhang
Yuwei Zhou
...
Jiacheng Jiang
Kangye Ji
Shijia Ge
Zhi Wang
Wenwu Zhu
97
1
0
21 Aug 2025
Survey of Vision-Language-Action Models for Embodied Manipulation
Survey of Vision-Language-Action Models for Embodied Manipulation
Haoran Li
Yuhui Chen
Wenbo Cui
Weiheng Liu
Kai Liu
Mingcai Zhou
Zhengtao Zhang
Dongbin Zhao
LM&Ro
466
4
0
21 Aug 2025
Grounding Actions in Camera Space: Observation-Centric Vision-Language-Action Policy
Grounding Actions in Camera Space: Observation-Centric Vision-Language-Action Policy
Tianyi Zhang
Haonan Duan
Haoran Hao
Yu Qiao
Jifeng Dai
Zhi Hou
137
3
0
18 Aug 2025
Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Rui Shao
W. Li
Lingsen Zhang
Renshan Zhang
Zhiyang Liu
Ran Chen
Liqiang Nie
LM&Ro
247
26
0
18 Aug 2025
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
T. Zemskova
A. Staroverov
Dmitry A. Yudin
Aleksandr I. Panov
103
0
0
15 Aug 2025
ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver
ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver
Wenxuan Song
Ziyang Zhou
Han Zhao
Jiayi Chen
Pengxiang Ding
Haodong Yan
Yuxin Huang
Feilong Tang
Xuetao Zhang
Haoang Li
LM&Ro
136
11
0
14 Aug 2025
Large Model Empowered Embodied AI: A Survey on Decision-Making and Embodied Learning
Large Model Empowered Embodied AI: A Survey on Decision-Making and Embodied Learning
Wenlong Liang
Rui Zhou
Yang Ma
Bing Zhang
Songlin Li
Yijia Liao
Ping Kuang
LM&Ro3DVAI4CE
169
8
0
14 Aug 2025
OmniVTLA: Vision-Tactile-Language-Action Model with Semantic-Aligned Tactile Sensing
OmniVTLA: Vision-Tactile-Language-Action Model with Semantic-Aligned Tactile Sensing
Zhengxue Cheng
Yiqian Zhang
Wenkang Zhang
Haoyu Li
Keyu Wang
Li Song
H. Zhang
163
6
0
12 Aug 2025
GeoVLA: Empowering 3D Representations in Vision-Language-Action Models
GeoVLA: Empowering 3D Representations in Vision-Language-Action Models
Lin Sun
Bin Xie
Yingfei Liu
Hao Shi
Tiancai Wang
Jiale Cao
146
14
0
12 Aug 2025
Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Liang Xu
Chengqun Yang
Zili Lin
Fei Xu
Yifan Liu
...
Yunhui Liu
Xin Jin
Manwen Liao
Wenjun Zeng
Xiaokang Yang
EgoV
245
1
0
06 Aug 2025
ActionSink: Toward Precise Robot Manipulation with Dynamic Integration of Action Flow
ActionSink: Toward Precise Robot Manipulation with Dynamic Integration of Action Flow
Shanshan Guo
Xiwen Liang
Junfan Lin
Yuzheng Zhuang
Guanbin Li
Xiaodan Liang
153
1
0
05 Aug 2025
H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation
H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation
Hongzhe Bi
Lingxuan Wu
Tianwei Lin
Hengkai Tan
Zhizhong Su
Hang Su
Jun-Jie Zhu
196
11
0
31 Jul 2025
Exploring the Link Between Bayesian Inference and Embodied Intelligence: Toward Open Physical-World Embodied AI Systems
Exploring the Link Between Bayesian Inference and Embodied Intelligence: Toward Open Physical-World Embodied AI Systems
Bin Liu
229
0
0
29 Jul 2025
Reconstructing 4D Spatial Intelligence: A Survey
Reconstructing 4D Spatial Intelligence: A Survey
Yukang Cao
Jiahao Lu
Z. Huang
Zhuowei Shen
Chengfeng Zhao
...
Z. Chen
Xin Li
Wenping Wang
Yuan Liu
Ziwei Liu
VGen
349
8
0
28 Jul 2025
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
Wenyao Zhang
Hongsi Liu
Zekun Qi
Yunnan Wang
X. Yu
...
He Wang
Dongbin Zhao
Li Yi
Wenjun Zeng
Xin Jin
VLM
218
44
0
06 Jul 2025
123
Next