Neighbor communities
0 / 0 papers shown
Top Contributors
| Name | # Papers | # Citations |
|---|---|---|
Social Events
| Date | Location | Event |
|---|---|---|
| Name | # Papers | # Citations |
|---|---|---|
| Date | Location | Event |
|---|---|---|
Investigates how language models can be integrated into robotic systems to enhance interaction, command interpretation, and autonomous decision-making.
SimVLA: A Simple VLA Baseline for Robotic Manipulation Yuankai Luo Woping Chen Tong Liang Baiqiao Wang Zhenguo Li | |||
CapNav: Benchmarking Vision Language Models on Capability-conditioned Indoor Navigation Xia Su Ruiqi Chen Benlin Liu Jingwei Ma Zonglin Di Ranjay Krishna Jon Froehlich | |||
Modeling Distinct Human Interaction in Web Agents Faria Huq Zora Zhiruo Wang Zhanqiu Guo Venu Arvind Arangarajan Tianyue Ou Frank Xu Shuyan Zhou Graham Neubig Jeffrey P. Bigham | |||
MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation Iman Ahmadi Mehrshad Taji Arad Mahdinezhad Kashani AmirHossein Jadidi Saina Kashani Babak Khalaj | |||
World Action Models are Zero-shot Policies Seonghyeon Ye Yunhao Ge Kaiyuan Zheng Shenyuan Gao Sihyun Yu ...Scott Reed Jan Kautz Yuke Zhu Linxi "Jim" Fan Joel Jang | |||
One Agent to Guide Them All: Empowering MLLMs for Vision-and-Language Navigation via Explicit World Representation Zerui Li Hongpei Zheng Fangguo Zhao Aidan Chan Jian Zhou Sihao Lin Shijie Li Qi Wu | |||
DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI En Yu Haoran Lv Jianjian Sun Kangheng Lin Ruitao Zhang ...Wenbin Tang Xiangyu Zhang Zheng Ge Erjin Zhou Tiancai Wang | |||
Plan-MCTS: Plan Exploration for Action Exploitation in Web Navigation Weiming Zhang Jihong Wang Jiamu Zhou Qingyao Li Xinbei Ma ...Weiwen Liu Zhuosheng Zhang Jun Wang Yong Yu Weinan Zhang | |||
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents Haiyang Xu Xi Zhang Haowei Liu Junyang Wang Zhaozai Zhu ...Ze Xu Shuai Bai Junyang Lin Jingren Zhou Ming Yan | |||
Ontological grounding for sound and natural robot explanations via large language models Alberto Olivares-Alarcos Muhammad Ahsan Satrio Sanjaya Hsien-I Lin Guillem Alenyà | |||
AgentRob: From Virtual Forum Agents to Hijacked Physical Robots Wenrui Liu Yaxuan Wang Xun Zhang Yanshu Wang Jiashen Wei ...Xinyang Chen Hengzhe Sun Jiyu Shen Jingjing He Tong Yang | |||
UniManip: General-Purpose Zero-Shot Robotic Manipulation with Agentic Operational Graph Haichao Liu Yuanjiang Xue Yuheng Zhou Haoyuan Deng Yinan Liang Lihua Xie Ziwei Wang | |||
Steerable Vision-Language-Action Policies for Embodied Reasoning and Hierarchical Control William Chen Jagdeep Singh Bhatia Catherine Glossop Nikhil Mathihalli Ria Doshi Andy Tang Danny Driess Karl Pertsch Sergey Levine | |||
How Do We Research Human-Robot Interaction in the Age of Large Language Models? A Systematic Review Yufeng Wang Yuan Xu Anastasia Nikolova Yuxuan Wang Jianyu Wang Chongyang Wang Xin Tong | |||
RynnBrain: Open Embodied Foundation Models Ronghao Dang Jiayan Guo Bohan Hou Sicong Leng Kehan Li ...Wenqiao Zhang Chengju Liu Jianfei Yang Shijian Lu Deli Zhao | |||
Agentic AI for Robot Control: Flexible but still Fragile Oscar Lima Marc Vinci Martin Günther Marian Renz Alexander Sung ...Zongyao Yi Felix Igelbrink Benjamin Kisliuk Martin Atzmueller Joachim Hertzberg | |||
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution Rui Cai Jun Guo Xinze He Piaopiao Jin Jie Li ...Diyun Xiang Yu Yang Hangjun Ye Yuan Zhang Quanyun Zhou | |||
Scaling Single Human Demonstrations for Imitation Learning using Generative Foundational Models Nick Heppert Minh Quang Nguyen Abhinav Valada | |||
In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach Yiran Gao Kim Hammar Tao Li | |||
HoloBrain-0 Technical Report Xuewu Lin Tianwei Lin Yun Du Hongyu Xie Yiwei Jin ...Ziang Li Chaodong Huang Hongzhe Bi Lichao Huang Zhizhong Su | |||
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning GigaBrain Team Boyuan Wang Chaojun Ni Guan Huang Guosheng Zhao ...Yilong Li Yukun Zhou Yun Ye Zhichao Liu Zheng Zhu | |||
Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks Zhihong Liu Yang Li Rengming Huang Cewu Lu Panpan Cai | |||
3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting Wancai Zheng Hao Chen Xianlong Lu Linlin Ou Xinyi Yu | |||
ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation Zedong Chu Shichao Xie Xiaolong Wu Yanfen Shen Minghua Luo ...Xiangpo Yang Menglin Yang Hongguang Xing Weiguo Li Mu Xu | |||
LAMP: Implicit Language Map for Robot NavigationIEEE Robotics and Automation Letters (IEEE RA-L), 2025 Sibaek Lee Hyeonwoo Yu Giseop Kim Sunwook Choi | |||
Budget-Constrained Agentic Large Language Models: Intention-Based Planning for Costly Tool Use Hanbing Liu Chunhao Tian Nan An Ziyuan Wang Pinyan Lu Changyuan Yu Qi Qi | |||
LocoVLM: Grounding Vision and Language for Adapting Versatile Legged Locomotion Policies I Made Aswin Nahrendra Seunghyun Lee Dongkyu Lee Hyun Myung | |||
MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation Yejin Kim Wilbert Pumacay Omar Rayyan Max Argus Winson Han ...Georgia Chalvatzaki Yuchen Cui Ali Farhadi Dieter Fox Ranjay Krishna | |||
LAP: Language-Action Pre-Training Enables Zero-shot Cross-Embodiment Transfer Lihan Zha Asher J. Hancock Mingtong Zhang Tenny Yin Yixuan Huang Dhruv Shah Allen Z. Ren Anirudha Majumdar | |||
Active Zero: Self-Evolving Vision-Language Models through Active Environment Exploration Jinghan He Junfeng Fang Feng Xiong Zijun Yao Fei Shen Haiyun Guo Jinqiao Wang Tat-Seng Chua | |||
Scaling World Model for Hierarchical Manipulation Policies Qian Long Yueze Wang Jiaxi Song Junbo Zhang Peiyan Li ...Xinlong Wang Zhongyuan Wang Xuguang Lan Huaping Liu Xinghang Li | |||
Say, Dream, and Act: Learning Video World Models for Instruction-Driven Robot Manipulation Songen Gu Yunuo Cai Tianyu Wang Simo Wu Yanwei Fu | |||
Discovering High Level Patterns from Simulation Traces Sean Memery Kartic Subr | |||
VideoAfford: Grounding 3D Affordance from Human-Object-Interaction Videos via Multimodal Large Language Model Hanqing Wang Mingyu Liu Xiaoyu Chen Chengwei MA Yiming Zhong ...Zhiqing Cui Jiahao Yuan Lu Dai Zhiyuan Ma Hui Xiong | |||
SAGE: Scalable Agentic 3D Scene Generation for Embodied AI Hongchi Xia Xuan Li Zhaoshuo Li Qianli Ma Jiashu Xu ...Tsung-Yi Lin Wei-Chiu Ma Shenlong Wang Shuran Song Fangyin Wei | |||
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation Yucheng Hu Jianke Zhang Yuanfei Luo Yanjiang Guo Xiaoyu Chen ...Qingzhou Lu Sheng Chen Yangang Zhang Wei Li Jianyu Chen | |||
RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation Hao Li Ziqin Wang Zi-han Ding Shuai Yang Yilun Chen ...Tai Wang Dahua Lin Feng Zhao Si Liu Jiangmiao Pang | |||
Rethinking Visual-Language-Action Model Scaling: Alignment, Mixture, and Regularization Ye Wang Sipeng Zheng Hao Luo Wanpeng Zhang Haoqi Yuan ...Yicheng Feng Mingyang Yu Zhiyu Kang Zongqing Lu Qin Jin | |||
UniPlan: Vision-Language Task Planning for Mobile Manipulation with Unified PDDL Formulation Haoming Ye Yunxiao Xiao Cewu Lu Panpan Cai | |||
SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes Nicholas Pfaff Thomas Cohn Sergey Zakharov Rick Cory Russ Tedrake | |||
UI-Venus-1.5 Technical Report Veuns-Team Changlong Gao Zhangxuan Gu Yulin Liu Xinyu Qiu ...Linchao Zhu Liang Chen Zhenyu Guo Changhua Meng Weiqiang Wang | |||
VISOR: VIsual Spatial Object Reasoning for Language-driven Object Navigation Francesco Taioli Shiping Yang Sonia Raychaudhuri Marco Cristani Unnat Jain Angel X Chang | |||
Action Hallucination in Generative Visual-Language-Action Models Harold Soh Eugene Lim | |||
VLN-Pilot: Large Vision-Language Model as an Autonomous Indoor Drone Operator Bessie Dominguez-Dager Sergio Suescun-Ferrandiz Felix Escalona Francisco Gomez-Donoso Miguel Cazorla | |||
MobileManiBench: Simplifying Model Verification for Mobile Manipulation Wenbo Wang Fangyun Wei QiXiu Li Xi Chen Yaobo Liang Chang Xu Jiaolong Yang Baining Guo | |||
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions Fangzhi Xu Hang Yan Qiushi Sun Jinyang Wu Zixian Huang ...Haoran Luo Xuanjing Huang Ben Kao Jun Liu Qika Lin | |||
Graph-based Agent Memory: Taxonomy, Techniques, and Applications Chang Yang Chuang Zhou Yilin Xiao Su Dong Luyao Zhuang ...Ninghao Liu Jinsong Su Xinrun Wang Yi Chang Xiao Huang | |||
Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration Jiaheng Liu Yuanxing Zhang Shihao Li Xinping Lei | |||
| Name (-) |
|---|
| Name (-) |
|---|
| Name (-) |
|---|
| Date | Location | Event | |
|---|---|---|---|
| No social events available | |||