Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.03461
Cited By
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
6 April 2019
Erik Wijmans
Samyak Datta
Oleksandr Maksymets
Abhishek Das
Georgia Gkioxari
Stefan Lee
Irfan Essa
Devi Parikh
Dhruv Batra
3DPC
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Embodied Question Answering in Photorealistic Environments with Point Cloud Perception"
50 / 114 papers shown
When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
Tao Wu
Chuhao Zhou
Guangyu Zhao
Haozhi Cao
Yewen Pu
J. Yang
375
0
0
04 Dec 2025
Vision to Geometry: 3D Spatial Memory for Sequential Embodied MLLM Reasoning and Exploration
Zhongyi Cai
Yi Du
Chen Wang
Yu Kong
LRM
154
1
0
02 Dec 2025
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
Gengyuan Zhang
Mingcong Ding
Jingpei Wu
Ruotong Liao
Volker Tresp
LRM
243
1
0
24 Nov 2025
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
Italian National Conference on Sensors (INS), 2025
Vinit Mehta
Charu Sharma
Karthick Thiyagarajan
LM&Ro
431
5
0
14 Nov 2025
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Shouwei Ruan
Liyuan Wang
Caixin Kang
Qihui Zhu
Songming Liu
Xingxing Wei
Hang Su
LM&Ro
226
9
0
24 Aug 2025
Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Rui Shao
W. Li
Lingsen Zhang
Renshan Zhang
Zhiyang Liu
Ran Chen
Liqiang Nie
LM&Ro
392
52
0
18 Aug 2025
Recursive Visual Imagination and Adaptive Linguistic Grounding for Vision Language Navigation
Bolei Chen
Jiaxu Kang
Yifei Wang
Ping Zhong
Qi Wu
Jianxin Wang
LM&Ro
190
0
0
29 Jul 2025
Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering
M. Ginting
Dong-Ki Kim
Xiangyun Meng
Andrzej Reinke
Bandi Jai Krishna
...
Amirreza Shaban
Sung-Kyun Kim
Mykel J. Kochenderfer
Ali-Akbar Agha-Mohammadi
Shayegan Omidshafiei
RALM
359
7
0
17 Jul 2025
EQA-RM: A Generative Embodied Reward Model with Test-time Scaling
Yuhang Chen
Zhen Tan
Tianlong Chen
469
1
0
12 Jun 2025
Point-MoE: Large-Scale Multi-Dataset Training with Mixture-of-Experts for 3D Semantic Segmentation
Xuweiyi Chen
Wentao Zhou
Aruni RoyChowdhury
Zezhou Cheng
3DPC
430
2
0
29 May 2025
3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model
Wenbo Hu
Yining Hong
Yanjun Wang
Leison Gao
Zibu Wei
Xingcheng Yao
Nanyun Peng
Yonatan Bitton
Idan Szpektor
Kai-Wei Chang
365
17
0
28 May 2025
Visual Environment-Interactive Planning for Embodied Complex-Question Answering
Ning Lan
Baoshan Ou
Xuemei Xie
G. Shi
LM&Ro
371
2
0
01 Apr 2025
RoboTron-Nav
\textit{RoboTron-Nav}
RoboTron-Nav
: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction
Yufeng Zhong
Chengjian Feng
Feng Yan
Fanfan Liu
Liming Zheng
Lin Ma
618
1
0
24 Mar 2025
MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation
P. Zhang
Xianqiang Gao
Yuhan Wu
Pengan Chen
Dong Wang
Zechuan Wang
Jiangwei Zhong
Yan Ding
Xiaochen Li
LM&Ro
349
15
0
14 Mar 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Kaixuan Jiang
Wenshu Fan
Weixing Chen
Jingzhou Luo
Ziliang Chen
Ling Pan
G. Li
Guanbin Li
514
19
0
14 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
674
2
0
11 Mar 2025
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans
Dillon Loh
Tomasz Bednarz
Xinxing Xia
Frank Guan
493
1
0
27 Nov 2024
LidaRefer: Context-aware Outdoor 3D Visual Grounding for Autonomous Driving
Yeong-Seung Baek
Heung-Seon Oh
375
0
0
07 Nov 2024
EfficientEQA: An Efficient Approach to Open-Vocabulary Embodied Question Answering
Kai Cheng
Zhengyuan Li
Xingpeng Sun
Byung-Cheol Min
Amrit Singh Bedi
Aniket Bera
225
10
0
26 Oct 2024
Mars: Situated Inductive Reasoning in an Open-World Environment
Neural Information Processing Systems (NeurIPS), 2024
Xiaojuan Tang
Jiaqi Li
Yitao Liang
Song-chun Zhu
Muhan Zhang
Zilong Zheng
LM&Ro
LRM
LLMAG
455
10
0
10 Oct 2024
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models
International Conference on Learning Representations (ICLR), 2024
Yue Zhang
Zhiyang Xu
Ying Shen
Parisa Kordjamshidi
Lifu Huang
396
23
0
04 Oct 2024
ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Abrar Anwar
John Welsh
Joydeep Biswas
Soha Pouya
Yan Chang
LM&Ro
241
58
0
20 Sep 2024
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
269
0
0
26 Jul 2024
3D Question Answering for City Scene Understanding
Yixiang Chen
Yaoxian Song
Xiang Liu
Xiaofei Yang
Qiang-qiang Wang
Tiefeng Li
Yang Yang
Xiaowen Chu
243
12
0
24 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Zehua Wang
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
800
241
0
09 Jul 2024
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
405
23
0
20 Jun 2024
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Wufei Ma
Guanning Zeng
Guofeng Zhang
Qihao Liu
Letian Zhang
Adam Kortylewski
Yaoyao Liu
Alan Yuille
VLM
3DV
303
17
0
13 Jun 2024
Evaluating Zero-Shot GPT-4V Performance on 3D Visual Question Answering Benchmarks
Simranjit Singh
Georgios Pavlakos
Dimitrios Stamoulis
343
11
0
29 May 2024
Map-based Modular Approach for Zero-shot Embodied Question Answering
Koya Sakamoto
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
M. Kawanabe
388
7
0
26 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
1.4K
215
0
23 May 2024
Think-Program-reCtify: 3D Situated Reasoning with Large Language Models
Qingrong He
Kejun Lin
Shizhe Chen
Anwen Hu
Qin Jin
LRM
277
4
0
23 Apr 2024
Following the Human Thread in Social Navigation
Luca Scofano
Alessio Sampieri
Tommaso Campari
Valentino Sacco
Indro Spinelli
Lamberto Ballan
Yuta Kyuragi
524
4
0
17 Apr 2024
Explore until Confident: Efficient Exploration for Embodied Question Answering
Allen Z. Ren
Jaden Clark
Anushri Dixit
Masha Itkina
Anirudha Majumdar
Dorsa Sadigh
477
79
0
23 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Haiwei Yang
Ruyue Yuan
LM&Ro
498
8
0
22 Feb 2024
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?
Gunshi Gupta
Tim G. J. Rudner
R. McAllister
Adrien Gaidon
Y. Gal
OffRL
262
4
0
28 Dec 2023
Towards Learning a Generalist Model for Embodied Navigation
Computer Vision and Pattern Recognition (CVPR), 2023
Duo Zheng
Shijia Huang
Lin Zhao
Yiwu Zhong
Liwei Wang
LM&Ro
785
145
0
04 Dec 2023
Exploitation-Guided Exploration for Semantic Embodied Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2023
Justin Wasserman
Girish Chowdhary
Abhinav Gupta
Unnat Jain
301
14
0
06 Nov 2023
Active Reasoning in an Open-World Environment
Neural Information Processing Systems (NeurIPS), 2023
Manjie Xu
Guangyuan Jiang
Weihan Liang
Fangqiu Yi
Yixin Zhu
LLMAG
LRM
351
16
0
03 Nov 2023
CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding
International Conference on Learning Representations (ICLR), 2023
Eslam Mohamed Bakr
Mohamed Ayman
Mahmoud Ahmed
Habib Slim
Mohamed Elhoseiny
LRM
453
16
0
10 Oct 2023
Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving
IEEE International Conference on Robotics and Automation (ICRA), 2023
Tushar Choudhary
Vikrant Dewangan
Shivam Chandhok
Shubham Priyadarshan
Anushka Jain
A. K. Singh
Siddharth Srivastava
Krishna Murthy Jatavallabhula
K. M. Krishna
378
122
0
03 Oct 2023
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Ziyu Guo
Renrui Zhang
Xiangyang Zhu
Yiwen Tang
Xianzheng Ma
...
Ke Chen
Shiyang Feng
Xianzhi Li
Jiaming Song
Pheng-Ann Heng
MLLM
451
211
0
01 Sep 2023
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
Francesco Taioli
Federico Cunico
Federico Girella
Riccardo Bologna
Alessandro Farinelli
Marco Cristani
227
10
0
17 Aug 2023
An Outlook into the Future of Egocentric Vision
International Journal of Computer Vision (IJCV), 2023
Chiara Plizzari
Gabriele Goletto
Antonino Furnari
Siddhant Bansal
Francesco Ragusa
G. Farinella
Dima Damen
Tatiana Tommasi
EgoV
345
85
0
14 Aug 2023
Object Goal Navigation with Recursive Implicit Maps
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Shizhe Chen
Thomas Chabal
Ivan Laptev
Cordelia Schmid
288
37
0
10 Aug 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
387
100
0
09 Aug 2023
Heterogeneous Embodied Multi-Agent Collaboration
IEEE Robotics and Automation Letters (RA-L), 2023
Xinzhu Liu
Di Guo
Huaping Liu
434
16
0
26 Jul 2023
TriVol: Point Cloud Rendering via Triple Volumes
Computer Vision and Pattern Recognition (CVPR), 2023
T. Hu
Xiaohan Li
Ruihang Chu
Jiaya Jia
3DPC
229
21
0
29 Mar 2023
360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Zhifeng Teng
Kailai Li
Kailun Yang
Kunyu Peng
Haowen Shi
Simon Reiß
Ke Cao
Rainer Stiefelhagen
MDE
303
28
0
21 Mar 2023
OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav
Karmesh Yadav
Arjun Majumdar
Ram Ramrakhya
Naoki Yokoyama
Alexei Baevski
Z. Kira
Oleksandr Maksymets
Dhruv Batra
ViT
394
83
0
14 Mar 2023
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
Zongtao He
Liuyi Wang
Shu Li
Qingqing Yan
Chengju Liu
Qi Chen
294
13
0
02 Mar 2023
1
2
3
Next
Page 1 of 3