Neighbor communities
0 / 0 papers shown
Top Contributors
| Name | # Papers | # Citations |
|---|---|---|
Social Events
| Date | Location | Event |
|---|---|---|
| Name | # Papers | # Citations |
|---|---|---|
| Date | Location | Event |
|---|---|---|
Investigates how language models can be integrated into robotic systems to enhance interaction, command interpretation, and autonomous decision-making.
The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents Ziyu Wang Chenyuan Liu Yushun Xiang Runhao Zhang Qingbo Hao ...Mingyu Zhang Kecheng Zheng Qian Zhu Ran Cheng Yong-Lu Li | |||
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models Linqing Zhong Yi Liu Yifei Wei Ziyu Xiong Maoqing Yao Si Liu Guanghui Ren | |||
Generative AI collective behavior needs an interactionist paradigm Laura Ferrarotti Gian Maria Campedelli Roberto Dessì Andrea Baronchelli Giovanni Iacca Kathleen M. Carley Alex Pentland Joel Z. Leibo James Evans Bruno Lepri | |||
VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory Shaoan Wang Yuanfei Luo Xingyu Chen Aocheng Luo Dongyue Li Chang Liu Sheng Chen Yangang Zhang Junzhi Yu | |||
Real2Sim based on Active Perception with automatically VLM-generated Behavior Trees Alessandro Adami Sebastian Zudaire Ruggero Carli Pietro Falco | |||
The Semantic Lifecycle in Embodied AI: Acquisition, Representation and Storage via Foundation Models Shuai Chen Hao Chen Yuanchen Bei Tianyang Zhao Zhibo Zhou Feiran Huang | |||
ShowUI-Aloha: Human-Taught GUI Agent Yichun Zhang Xiangwu Guo Yauhong Goh Jessica Hu Zhiheng Chen Xin Wang Difei Gao Mike Zheng Shou | |||
Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration Sen Wang Bangwei Liu Zhenkun Gao Lizhuang Ma Xuhong Wang Yuan Xie Xin Tan | |||
Agentic AI Empowered Intent-Based Networking for 6G Genze Jiang Kezhi Wang Xiaomin Chen Yizhou Huang | |||
SceneFoundry: Generating Interactive Infinite 3D Worlds ChunTeng Chen YiChen Hsu YiWen Liu WeiFang Sun TsaiChing Ni ChunYi Lee Min Sun YuanFu Yang | |||
LaST: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model Zhuoyang Liu Jiaming Liu Hao Chen Ziyu Guo Chengkai Hou ...Renrui Zhang Zhengping Che Jian Tang Pheng-Ann Heng Shanghang Zhang | |||
RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation Boyang Wang Haoran Zhang Shujie Zhang Jinkun Hao Mingda Jia ...Yucheng Mao Zhaoyang Lyu Jia Zeng Xudong Xu Jiangmiao Pang | |||
SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning Yanchang Liang Xiaowei Zhao | |||
LinguaGame: A Linguistically Grounded Game-Theoretic Paradigm for Multi-Agent Dialogue Generation Yuxiao Ye Yiming Zhang Yiran Ma Huiyuan Xie Huining Zhu Zhiyuan Liu | |||
SeqWalker: Sequential-Horizon Vision-and-Language Navigation with Hierarchical Planning Zebin Han Xudong Wang Baichen Liu Qi Lyu Zhenduo Shang Jiahua Dong Lianqing Liu Zhi Han | |||
Intent at a Glance: Gaze-Guided Robotic Manipulation via Foundation Models Tracey Yee Hsin Tay Xu Yan Jonathan Ouyang Daniel Wu William Jiang Jonathan Kao Yuchen Cui | |||
Stable Language Guidance for Vision-Language-Action Models Zhihao Zhan Yuhao Chen Jiaying Zhou Qinhan Lv Hao Liu Keze Wang Liang Lin Guangrun Wang | |||
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation Wenlong Huang Yu-Wei Chao Arsalan Mousavian Ming-Yu Liu Dieter Fox Kaichun Mo Li Fei-Fei | |||
The Path Ahead for Agentic AI: Challenges and Opportunities Nadia Sibai Yara Ahmed Serry Sibaee Sawsan AlHalawani Adel Ammar Wadii Boulila | |||
ChemBART: A Pre-trained BART Model Assisting Organic Chemistry Analysis Kenan Li Yijian Zhang Jin Wang Haipeng Gan Zeying Sun Xiaoguang Lei Hao Dong | |||
Genie Sim 3.0 : A High-Fidelity Comprehensive Simulation Platform for Humanoid Robot Chenghao Yin Da Huang Di Yang Jichao Wang Nanshu Zhao ...Rui Feng Zhenquan Pang Jiayu Li Qian Wang Maoqing Yao | |||
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Jing Tan Zhaoyang Zhang Yantao Shen Jiarui Cai Shuo Yang Jiajun Wu Wei Xia Zhuowen Tu Stefano Soatto | |||
Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems Niloufar Alipour Talemi Julia Boone Fatemeh Afghah | |||
SAGE-32B: Agentic Reasoning via Iterative Distillation Basab Jha Firoj Paudel Ujjwal Puri Ethan Henkel Zhang Yuting Mateusz Kowalczyk Mei Huang Choi Donghyuk Wang Junhao | |||
NitroGen: An Open Foundation Model for Generalist Gaming Agents Loïc Magne Anas Awadalla Guanzhi Wang Yinzhen Xu Joshua Belofsky ...Jan Kautz Yisong Yue Yejin Choi Yuke Zhu Linxi "Jim" Fan | |||
AMAP Agentic Planning Technical Report AMAP AI Agent Team Yulan Hu Xiangwen Zhang Sheng Ouyang Hao Yi ...Yinfeng Huang Ning Wang Tucheng Lin Xin Li Ning Guo | |||
VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents Xunyi Zhao Gengze Zhou Qi Wu | |||
RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence Chengkai Hou Kun Wu Jiaming Liu Zhengping Che Di Wu ...Junjie Ji Haonan Liu Kuan Cheng Shanghang Zhang Jian Tang | |||
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Junru Lu Jiarui Qin Lingfeng Qiao Yinghui Li Xinyi Dai ...Daohai Yu Jiahao Li Ke Li Zongyi Li Xiaoyu Tan | |||
Agentic Physical AI toward a Domain-Specific Foundation Model for Nuclear Reactor Control Yoonpyo Lee Kazuma Kobayashi Sai Puppala Sajedul Talukder Seid Koric Souvik Chakraborty Syed Bahauddin Alam | |||
Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives Shuanghao Bai Wenxuan Song Jiayi Chen Yuheng Ji Zhide Zhong ...Xiaolong Zheng Donglin Wang Haoang Li Shanghang Zhang Badong Chen | |||
Monadic Context Engineering Yifan Zhang Yang Yuan Mengdi Wang Andrew Chi-Chih Yao | |||
Emergence of Human to Robot Transfer in Vision-Language-Action Models Simar Kareer Karl Pertsch James Darpinian Judy Hoffman Danfei Xu Sergey Levine Chelsea Finn Suraj Nair | |||
Clutter-Resistant Vision-Language-Action Models through Object-Centric and Geometry Grounding Khoa Vo Taisei Hanyu Yuki Ikebe Trong Thang Pham Nhat Chung ...Duy Nguyen Ho Minh Anh Nguyen Anthony Gunderman Chase Rainwater Ngan Le | |||
VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs Wensi Huang Shaohao Zhu Meng Wei Jinming Xu Xihui Liu Hanqing Wang Tai Wang Feng Zhao Jiangmiao Pang | |||
Break Out the Silverware -- Semantic Understanding of Stored Household Items Michaela Levi-Richter Reuth Mirsky Oren Glickman | |||
HELP: Hierarchical Embodied Language Planner for Household Tasks Alexandr V. Korchemnyi Anatoly O. Onishchenko Eva A. Bakaeva Alexey K. Kovalev Aleksandr I. Panov | |||
LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation Anatoly O. Onishchenko Alexey K. Kovalev Aleksandr I. Panov | |||
MaP-AVR: A Meta-Action Planner for Agents Leveraging Vision Language Models and Retrieval-Augmented Generation Zhenglong Guo Yiming Zhao Feng Jiang Heng Jin Zongbao Feng Jianbin Zhou Siyuan Xu | |||
Vision-Language-Policy Model for Dynamic Robot Task Planning Jin Wang Kim Tien Ly Jacques Cloete Nikos Tsagarakis Ioannis Havoutis | |||
REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation Martin Sedlacek Pavlo Yefanov Georgy Ponimatkin Jai Bardhan Simon Pilc Mederic Fourmy Evangelos Kazakos Cees G. M. Snoek Josef Sivic Vladimir Petrik | |||
Point What You Mean: Visually Grounded Instruction Policy Hang Yu Juntu Zhao Yufeng Liu Kaiyu Li Cheng Ma ...Guang Chen Junyuan Xie Junliang Guo Junqiao Zhao Yang Gao | |||
VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation Sihao Lin Zerui Li Xunyi Zhao Gengze Zhou Liuyi Wang ...Hanqing Wang Jiangmiao Pang Anton van den Hengel Jiajun Liu Qi Wu | |||
| Name (-) |
|---|
| Name (-) |
|---|
| Name (-) |
|---|
| Date | Location | Event | |
|---|---|---|---|
| No social events available | |||