ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.03461
  4. Cited By
Embodied Question Answering in Photorealistic Environments with Point
  Cloud Perception

Embodied Question Answering in Photorealistic Environments with Point Cloud Perception

6 April 2019
Erik Wijmans
Samyak Datta
Oleksandr Maksymets
Abhishek Das
Georgia Gkioxari
Stefan Lee
Irfan Essa
Devi Parikh
Dhruv Batra
    3DPC
    LM&Ro
ArXivPDFHTML

Papers citing "Embodied Question Answering in Photorealistic Environments with Point Cloud Perception"

33 / 33 papers shown
Title
Visual Environment-Interactive Planning for Embodied Complex-Question Answering
Visual Environment-Interactive Planning for Embodied Complex-Question Answering
Ning Lan
Baoshan Ou
Xuemei Xie
G. Shi
LM&Ro
66
1
0
01 Apr 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Kaixuan Jiang
Y. Liu
Weixing Chen
Jingzhou Luo
Ziliang Chen
Ling Pan
G. Li
Liang Lin
51
2
0
14 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
61
0
0
11 Mar 2025
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models
Yue Zhang
Zhiyang Xu
Ying Shen
Parisa Kordjamshidi
Lifu Huang
32
6
0
04 Oct 2024
Answerability Fields: Answerable Location Estimation via Diffusion
  Models
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
48
0
0
26 Jul 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
71
41
0
23 May 2024
Following the Human Thread in Social Navigation
Following the Human Thread in Social Navigation
Luca Scofano
Alessio Sampieri
Tommaso Campari
Valentino Sacco
Indro Spinelli
Lamberto Ballan
Fabio Galasso
31
0
0
17 Apr 2024
An Outlook into the Future of Egocentric Vision
An Outlook into the Future of Egocentric Vision
Chiara Plizzari
Gabriele Goletto
Antonino Furnari
Siddhant Bansal
Francesco Ragusa
G. Farinella
Dima Damen
Tatiana Tommasi
EgoV
32
38
0
14 Aug 2023
MLANet: Multi-Level Attention Network with Sub-instruction for
  Continuous Vision-and-Language Navigation
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
Zongtao He
Liuyi Wang
Shu Li
Qingqing Yan
Chengju Liu
Qi Chen
19
7
0
02 Mar 2023
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved
  Visio-Linguistic Models in 3D Scenes
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes
Ahmed Abdelreheem
Kyle Olszewski
Hsin-Ying Lee
Peter Wonka
Panos Achlioptas
3DPC
20
28
0
12 Dec 2022
A General Purpose Supervisory Signal for Embodied Agents
A General Purpose Supervisory Signal for Embodied Agents
Kunal Pratap Singh
Jordi Salvador
Luca Weihs
Aniruddha Kembhavi
SSL
21
3
0
01 Dec 2022
AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments
AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments
Sudipta Paul
A. Roy-Chowdhury
A. Cherian
25
23
0
14 Oct 2022
Learning a Visually Grounded Memory Assistant
Learning a Visually Grounded Memory Assistant
Meera Hahn
Kevin Carlberg
Ruta Desai
James M. Hillis
17
1
0
07 Oct 2022
Iterative Vision-and-Language Navigation
Iterative Vision-and-Language Navigation
Jacob Krantz
Shurjo Banerjee
Wang Zhu
Jason J. Corso
Peter Anderson
Stefan Lee
Jesse Thomason
LM&Ro
40
18
0
06 Oct 2022
Episodic Memory Question Answering
Episodic Memory Question Answering
Samyak Datta
Sameer Dharur
Vincent Cartillier
Ruta Desai
Mukul Khanna
Dhruv Batra
Devi Parikh
EgoV
11
31
0
03 May 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human
  Demonstrations at Scale
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
24
109
0
07 Apr 2022
3D Question Answering
3D Question Answering
Shuquan Ye
Dongdong Chen
Songfang Han
Jing Liao
ViT
24
46
0
15 Dec 2021
Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$
  Videos
Pano-AVQA: Grounded Audio-Visual Question Answering on 360∘^\circ∘ Videos
Heeseung Yun
Youngjae Yu
Wonsuk Yang
Kangil Lee
Gunhee Kim
25
78
0
11 Oct 2021
Knowledge-based Embodied Question Answering
Knowledge-based Embodied Question Answering
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
22
20
0
16 Sep 2021
The Surprising Effectiveness of Visual Odometry Techniques for Embodied
  PointGoal Navigation
The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
Xiaoming Zhao
Harsh Agrawal
Dhruv Batra
A. Schwing
28
40
0
26 Aug 2021
Core Challenges in Embodied Vision-Language Planning
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
39
45
0
26 Jun 2021
A Survey on Human-aware Robot Navigation
A Survey on Human-aware Robot Navigation
Ronja Möller
Antonino Furnari
S. Battiato
Aki Härmä
G. Farinella
31
87
0
22 Jun 2021
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD
  Images
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
Haolin Liu
Anran Lin
Xiaoguang Han
Lei Yang
Yizhou Yu
Shuguang Cui
24
39
0
14 Mar 2021
How to Train PointGoal Navigation Agents on a (Sample and Compute)
  Budget
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget
Erik Wijmans
Irfan Essa
Dhruv Batra
3DPC
17
10
0
11 Dec 2020
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation
Saim Wani
Shivansh Patel
Unnat Jain
Angel X. Chang
Manolis Savva
32
103
0
07 Dec 2020
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning
  with Intrinsic-Extrinsic Modeling
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling
Xin Ye
Yezhou Yang
17
14
0
16 Oct 2020
Generative Language-Grounded Policy in Vision-and-Language Navigation
  with Bayes' Rule
Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
Shuhei Kurita
Kyunghyun Cho
LM&Ro
9
23
0
16 Sep 2020
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous
  Environments
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Jacob Krantz
Erik Wijmans
Arjun Majumdar
Dhruv Batra
Stefan Lee
24
263
0
06 Apr 2020
An Exploration of Embodied Visual Exploration
An Exploration of Embodied Visual Exploration
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
LM&Ro
25
98
0
07 Jan 2020
Simultaneous Mapping and Target Driven Navigation
Simultaneous Mapping and Target Driven Navigation
G. Georgakis
Yimeng Li
Jana Kosecka
17
16
0
18 Nov 2019
Neural Modular Control for Embodied Question Answering
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
129
127
0
26 Oct 2018
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
99
3,116
0
02 Dec 2016
CAD2RL: Real Single-Image Flight without a Single Real Image
CAD2RL: Real Single-Image Flight without a Single Real Image
Fereshteh Sadeghi
Sergey Levine
SSL
221
809
0
13 Nov 2016
1