Neighbor communities
0 / 0 papers shown
Title |
|---|
Top Contributors
| Name | # Papers | # Citations |
|---|---|---|
Social Events
| Date | Location | Event |
|---|---|---|
Title |
|---|
| Name | # Papers | # Citations |
|---|---|---|
| Date | Location | Event |
|---|---|---|
Investigates how language models can be integrated into robotic systems to enhance interaction, command interpretation, and autonomous decision-making.
Title |
|---|
Title | |||
|---|---|---|---|
LaST: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model Zhuoyang Liu Jiaming Liu Hao Chen Ziyu Guo Chengkai Hou ...Renrui Zhang Zhengping Che Jian Tang Pheng-Ann Heng Shanghang Zhang | |||
![]() ImagineNav++: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination Teng Wang Xinxin Zhao Wenzhe Cai Changyin Sun | |||
![]() Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Jing Tan Zhaoyang Zhang Yantao Shen Jiarui Cai Shuo Yang Jiajun Wu Wei Xia Zhuowen Tu Stefano Soatto | |||
LinguaGame: A Linguistically Grounded Game-Theoretic Paradigm for Multi-Agent Dialogue Generation Yuxiao Ye Yiming Zhang Yiran Ma Huiyuan Xie Huining Zhu Zhiyuan Liu | |||
SeqWalker: Sequential-Horizon Vision-and-Language Navigation with Hierarchical Planning Zebin Han Xudong Wang Baichen Liu Qi Lyu Zhenduo Shang Jiahua Dong Lianqing Liu Zhi Han | |||
RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation Boyang Wang Haoran Zhang Shujie Zhang Jinkun Hao Mingda Jia ...Yucheng Mao Zhaoyang Lyu Jia Zeng Xudong Xu Jiangmiao Pang | |||
SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning Yanchang Liang Xiaowei Zhao | |||
![]() PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation Wenlong Huang Yu-Wei Chao Arsalan Mousavian Ming-Yu Liu Dieter Fox Kaichun Mo Li Fei-Fei | |||
![]() Stable Language Guidance for Vision-Language-Action Models Zhihao Zhan Yuhao Chen Jiaying Zhou Qinhan Lv Hao Liu Keze Wang Liang Lin Guangrun Wang | |||
![]() The Path Ahead for Agentic AI: Challenges and Opportunities Nadia Sibai Yara Ahmed Serry Sibaee Sawsan AlHalawani Adel Ammar Wadii Boulila | |||
![]() ChemBART: A Pre-trained BART Model Assisting Organic Chemistry Analysis Kenan Li Yijian Zhang Jin Wang Haipeng Gan Zeying Sun Xiaoguang Lei Hao Dong | |||
![]() Genie Sim 3.0 : A High-Fidelity Comprehensive Simulation Platform for Humanoid Robot Chenghao Yin Da Huang Di Yang Jichao Wang Nanshu Zhao ...Rui Feng Zhenquan Pang Jiayu Li Qian Wang Maoqing Yao | |||
![]() Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems Niloufar Alipour Talemi Julia Boone Fatemeh Afghah | |||
SAGE-32B: Agentic Reasoning via Iterative Distillation Basab Jha Firoj Paudel Ujjwal Puri Ethan Henkel Zhang Yuting Mateusz Kowalczyk Mei Huang Choi Donghyuk Wang Junhao | |||
![]() NitroGen: An Open Foundation Model for Generalist Gaming Agents Loïc Magne Anas Awadalla Guanzhi Wang Yinzhen Xu Joshua Belofsky ...Jan Kautz Yisong Yue Yejin Choi Yuke Zhu Linxi "Jim" Fan | |||
![]() Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Junru Lu Jiarui Qin Lingfeng Qiao Yinghui Li Xinyi Dai ...Daohai Yu Jiahao Li Ke Li Zongyi Li Xiaoyu Tan | |||
![]() RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence Chengkai Hou Kun Wu Jiaming Liu Zhengping Che Di Wu ...Junjie Ji Haonan Liu Kuan Cheng Shanghang Zhang Jian Tang | |||
![]() VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents Xunyi Zhao Gengze Zhou Qi Wu | |||
![]() AMAP Agentic Planning Technical Report AMAP AI Agent Team Yulan Hu Xiangwen Zhang Sheng Ouyang Hao Yi ...Yinfeng Huang Ning Wang Tucheng Lin Xin Li Ning Guo | |||
![]() Agentic Physical AI toward a Domain-Specific Foundation Model for Nuclear Reactor Control Yoonpyo Lee Kazuma Kobayashi Sai Puppala Sajedul Talukder Seid Koric Souvik Chakraborty Syed Bahauddin Alam | |||
![]() Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives Shuanghao Bai Wenxuan Song Jiayi Chen Yuheng Ji Zhide Zhong ...Xiaolong Zheng Donglin Wang Haoang Li Shanghang Zhang Badong Chen | |||
![]() Clutter-Resistant Vision-Language-Action Models through Object-Centric and Geometry Grounding Khoa Vo Taisei Hanyu Yuki Ikebe Trong Thang Pham Nhat Chung ...Duy Nguyen Ho Minh Anh Nguyen Anthony Gunderman Chase Rainwater Ngan Le | |||
![]() Emergence of Human to Robot Transfer in Vision-Language-Action Models Simar Kareer Karl Pertsch James Darpinian Judy Hoffman Danfei Xu Sergey Levine Chelsea Finn Suraj Nair | |||
![]() Monadic Context Engineering Yifan Zhang Yang Yuan Mengdi Wang Andrew Chi-Chih Yao | |||
![]() VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs Wensi Huang Shaohao Zhu Meng Wei Jinming Xu Xihui Liu Hanqing Wang Tai Wang Feng Zhao Jiangmiao Pang | |||
![]() Break Out the Silverware -- Semantic Understanding of Stored Household Items Michaela Levi-Richter Reuth Mirsky Oren Glickman | |||
![]() HELP: Hierarchical Embodied Language Planner for Household Tasks Alexandr V. Korchemnyi Anatoly O. Onishchenko Eva A. Bakaeva Alexey K. Kovalev Aleksandr I. Panov | |||
![]() LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation Anatoly O. Onishchenko Alexey K. Kovalev Aleksandr I. Panov | |||
![]() Affordance RAG: Hierarchical Multimodal Retrieval with Affordance-Aware Embodied Memory for Mobile Manipulation Ryosuke Korekata Quanting Xie Yonatan Bisk Komei Sugiura | |||
![]() VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation Sihao Lin Zerui Li Xunyi Zhao Gengze Zhou Liuyi Wang ...Hanqing Wang Jiangmiao Pang Anton van den Hengel Jiajun Liu Qi Wu | |||
![]() REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation Martin Sedlacek Pavlo Yefanov Georgy Ponimatkin Jai Bardhan Simon Pilc Mederic Fourmy Evangelos Kazakos Cees G. M. Snoek Josef Sivic Vladimir Petrik | |||
![]() MaP-AVR: A Meta-Action Planner for Agents Leveraging Vision Language Models and Retrieval-Augmented Generation Zhenglong Guo Yiming Zhao Feng Jiang Heng Jin Zongbao Feng Jianbin Zhou Siyuan Xu | |||
![]() Point What You Mean: Visually Grounded Instruction Policy Hang Yu Juntu Zhao Yufeng Liu Kaiyu Li Cheng Ma ...Guang Chen Junyuan Xie Junliang Guo Junqiao Zhao Yang Gao | |||
![]() Vision-Language-Policy Model for Dynamic Robot Task Planning Jin Wang Kim Tien Ly Jacques Cloete Nikos Tsagarakis Ioannis Havoutis | |||
![]() Emergent Persuasion: Will LLMs Persuade Without Being Prompted? Vincent Chang Thee Ho Sunishchal Dev Kevin Zhu Shi Feng Kellin Pelrine Matthew Kowal | |||
![]() Neuro-Symbolic Control with Large Language Models for Language-Guided Spatial Tasks Momina Liaqat Ali Muhammad Abid | |||
![]() RecipeMasterLLM: Revisiting RoboEarth in the Era of Large Language Models Asil Kaan Bozcuoglu Ziyuan Liu | |||
![]() Embodied4C: Measuring What Matters for Embodied Vision-Language Navigation Tin Stribor Sohn Maximilian Dillitzer Jason J. Corso Eric Sax | |||
![]() LangDriveCTRL: Natural Language Controllable Driving Scene Editing with Multi-modal Agents Yun He Francesco Pittaluga Ziyu Jiang Matthias Zwicker Manmohan Chandraker Zaid Tasneem | |||
![]() Lang2Manip: A Tool for LLM-Based Symbolic-to-Geometric Planning for Manipulation Muhayy Ud Din Jan Rosell Waseem Akram Irfan Hussain | |||
![]() PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence Xiaopeng Lin Shijie Lian Bin Yu Ruoqi Yang Changti Wu ...Yurun Jin Yukun Shi Cong Huang Bojun Cheng Kai Chen | |||
![]() MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Yuanchen Ju Yongyuan Liang Yen-Jen Wang Nandiraju Gireesh Yuanliang Ju Seungjae Lee Qiao Gu Elvis Hsieh Furong Huang Koushil Sreenath | |||
![]() City Navigation in the Wild: Exploring Emergent Navigation from Web-Scale Knowledge in MLLMs Dwip Dalal Utkarsh Mishra Narendra Ahuja Nebojsa Jojic | |||
![]() Large Video Planner Enables Generalizable Robot Control Boyuan Chen Tianyuan Zhang Haoran Geng Kiwhan Song Caiyi Zhang ...Jitendra Malik Pieter Abbeel Russ Tedrake Vincent Sitzmann Yilun Du | |||
![]() mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs Jonas Pai Liam Achenbach Victoriano Montesinos Benedek Forrai Oier Mees Elvis Nava | |||
![]() MiVLA: Towards Generalizable Vision-Language-Action Model with Human-Robot Mutual Imitation Pre-training Zhenhan Yin Xuanhan Wang Jiahao Jiang Kaiyuan Deng Pengqi Chen ...Chong Liu Xing Xu Jingkuan Song Lianli Gao Heng Tao Shen | |||
| Name (-) |
|---|
| Name (-) |
|---|
| Name (-) |
|---|
| Date | Location | Event | |
|---|---|---|---|
| No social events available | |||