Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.10333
Cited By
ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver
14 August 2025
Wenxuan Song
Ziyang Zhou
Han Zhao
Jiayi Chen
Pengxiang Ding
Haodong Yan
Yuxin Huang
Feilong Tang
Xuetao Zhang
Haoang Li
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver"
10 / 10 papers shown
TouchFormer: A Robust Transformer-based Framework for Multimodal Material Perception
Kailin Lyu
Long Xiao
Jianing Zeng
Junhao Dong
Xuexin Liu
Zhuojun Zou
Haoyue Yang
Lin Shu
Jie Hao
68
0
0
24 Nov 2025
AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
Lei Xiao
Jifeng Li
Juntao Gao
Feiyang Ye
Yan Jin
Jingjing Qian
Jing Zhang
Y. Wu
Xiaoyuan Yu
350
0
0
24 Nov 2025
Multi-speaker Attention Alignment for Multimodal Social Interaction
Liangyang Ouyang
Yifei Huang
Mingfang Zhang
Caixin Kang
Ryosuke Furuta
Yoichi Sato
128
0
0
22 Nov 2025
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process
Jiayi Chen
Wenxuan Song
Pengxiang Ding
Ziyang Zhou
Han Zhao
Feilong Tang
Donglin Wang
Haoang Li
154
3
0
03 Nov 2025
A Survey on Efficient Vision-Language-Action Models
Zhaoshu Yu
Bo Wang
Pengpeng Zeng
Haonan Zhang
Ji Zhang
Lianli Gao
Jingkuan Song
Nicu Sebe
Heng Tao Shen
Heng Tao Shen
LM&Ro
219
7
0
27 Oct 2025
QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision-Language-Action Models
Y. Li
Yihao Chen
Mingcai Zhou
Haoran Li
Zhengtao Zhang
Dongbin Zhao
VLM
132
1
0
16 Oct 2025
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model
Fuhao Li
Wenxuan Song
Han Zhao
Jingbo Wang
Pengxiang Ding
Donglin Wang
Long Zeng
Haoang Li
217
7
0
14 Oct 2025
NoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation
Zheng Huang
Mingyu Liu
Xiaoyi Lin
Huanyi Zheng
Canyu Zhao
...
Xiaoman Li
Yiduo Jia
Hao Zhong
Hao Chen
Chunhua Shen
118
1
0
04 Oct 2025
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
Yihao Wang
Pengxiang Ding
Lingxiao Li
Can Cui
Zirui Ge
...
Yifan Tang
Wenhui Wang
Ru Zhang
Jianyi Liu
Donglin Wang
272
29
0
11 Sep 2025
FlowVLA: Visual Chain of Thought-based Motion Reasoning for Vision-Language-Action Models
Zhide Zhong
Haodong Yan
Junfeng Li
Xiangchen Liu
Xin Gong
...
Wenxuan Song
Jiayi Chen
Xinhu Zheng
Hesheng Wang
Haoang Li
LRM
VGen
234
3
0
25 Aug 2025
1
Page 1 of 1