Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.03461
Cited By
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
6 April 2019
Erik Wijmans
Samyak Datta
Oleksandr Maksymets
Abhishek Das
Georgia Gkioxari
Stefan Lee
Irfan Essa
Devi Parikh
Dhruv Batra
3DPC
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Embodied Question Answering in Photorealistic Environments with Point Cloud Perception"
33 / 33 papers shown
Title
Visual Environment-Interactive Planning for Embodied Complex-Question Answering
Ning Lan
Baoshan Ou
Xuemei Xie
G. Shi
LM&Ro
66
1
0
01 Apr 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Kaixuan Jiang
Y. Liu
Weixing Chen
Jingzhou Luo
Ziliang Chen
Ling Pan
G. Li
Liang Lin
51
2
0
14 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
61
0
0
11 Mar 2025
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models
Yue Zhang
Zhiyang Xu
Ying Shen
Parisa Kordjamshidi
Lifu Huang
32
6
0
04 Oct 2024
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
48
0
0
26 Jul 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
71
41
0
23 May 2024
Following the Human Thread in Social Navigation
Luca Scofano
Alessio Sampieri
Tommaso Campari
Valentino Sacco
Indro Spinelli
Lamberto Ballan
Fabio Galasso
31
0
0
17 Apr 2024
An Outlook into the Future of Egocentric Vision
Chiara Plizzari
Gabriele Goletto
Antonino Furnari
Siddhant Bansal
Francesco Ragusa
G. Farinella
Dima Damen
Tatiana Tommasi
EgoV
32
38
0
14 Aug 2023
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
Zongtao He
Liuyi Wang
Shu Li
Qingqing Yan
Chengju Liu
Qi Chen
19
7
0
02 Mar 2023
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes
Ahmed Abdelreheem
Kyle Olszewski
Hsin-Ying Lee
Peter Wonka
Panos Achlioptas
3DPC
20
28
0
12 Dec 2022
A General Purpose Supervisory Signal for Embodied Agents
Kunal Pratap Singh
Jordi Salvador
Luca Weihs
Aniruddha Kembhavi
SSL
21
3
0
01 Dec 2022
AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments
Sudipta Paul
A. Roy-Chowdhury
A. Cherian
25
23
0
14 Oct 2022
Learning a Visually Grounded Memory Assistant
Meera Hahn
Kevin Carlberg
Ruta Desai
James M. Hillis
17
1
0
07 Oct 2022
Iterative Vision-and-Language Navigation
Jacob Krantz
Shurjo Banerjee
Wang Zhu
Jason J. Corso
Peter Anderson
Stefan Lee
Jesse Thomason
LM&Ro
40
18
0
06 Oct 2022
Episodic Memory Question Answering
Samyak Datta
Sameer Dharur
Vincent Cartillier
Ruta Desai
Mukul Khanna
Dhruv Batra
Devi Parikh
EgoV
11
31
0
03 May 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
24
109
0
07 Apr 2022
3D Question Answering
Shuquan Ye
Dongdong Chen
Songfang Han
Jing Liao
ViT
24
46
0
15 Dec 2021
Pano-AVQA: Grounded Audio-Visual Question Answering on 360
∘
^\circ
∘
Videos
Heeseung Yun
Youngjae Yu
Wonsuk Yang
Kangil Lee
Gunhee Kim
25
78
0
11 Oct 2021
Knowledge-based Embodied Question Answering
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
22
20
0
16 Sep 2021
The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
Xiaoming Zhao
Harsh Agrawal
Dhruv Batra
A. Schwing
28
40
0
26 Aug 2021
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
39
45
0
26 Jun 2021
A Survey on Human-aware Robot Navigation
Ronja Möller
Antonino Furnari
S. Battiato
Aki Härmä
G. Farinella
31
87
0
22 Jun 2021
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
Haolin Liu
Anran Lin
Xiaoguang Han
Lei Yang
Yizhou Yu
Shuguang Cui
24
39
0
14 Mar 2021
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget
Erik Wijmans
Irfan Essa
Dhruv Batra
3DPC
17
10
0
11 Dec 2020
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation
Saim Wani
Shivansh Patel
Unnat Jain
Angel X. Chang
Manolis Savva
32
103
0
07 Dec 2020
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling
Xin Ye
Yezhou Yang
17
14
0
16 Oct 2020
Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
Shuhei Kurita
Kyunghyun Cho
LM&Ro
9
23
0
16 Sep 2020
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Jacob Krantz
Erik Wijmans
Arjun Majumdar
Dhruv Batra
Stefan Lee
24
263
0
06 Apr 2020
An Exploration of Embodied Visual Exploration
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
LM&Ro
25
98
0
07 Jan 2020
Simultaneous Mapping and Target Driven Navigation
G. Georgakis
Yimeng Li
Jana Kosecka
17
16
0
18 Nov 2019
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
129
127
0
26 Oct 2018
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
99
3,116
0
02 Dec 2016
CAD2RL: Real Single-Image Flight without a Single Real Image
Fereshteh Sadeghi
Sergey Levine
SSL
221
809
0
13 Nov 2016
1