Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.05132
Cited By
v1
v2
v3 (latest)
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Computer Vision and Pattern Recognition (CVPR), 2024
7 June 2024
Jianing Yang
Xuweiyi Chen
Nikhil Madaan
Madhavan Iyengar
Shengyi Qian
David Fouhey
Joyce Chai
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (31 upvotes)
Papers citing
"3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination"
31 / 81 papers shown
3D Concept Learning and Reasoning from Multi-View Images
Computer Vision and Pattern Recognition (CVPR), 2023
Yining Hong
Chun-Tse Lin
Yilun Du
Zhenfang Chen
J. Tenenbaum
Chuang Gan
3DV
274
72
0
20 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
International Conference on Machine Learning (ICML), 2023
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
1.3K
6,661
0
30 Jan 2023
Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments
IEEE Robotics and Automation Letters (RA-L), 2023
Mayank Mittal
C. Yu
Qinxi Yu
Jingzhou Liu
Nikita Rudin
...
Ajay Mandlekar
Buck Babich
Gavriel State
Marco Hutter
Animesh Garg
311
409
0
10 Jan 2023
Is GPT-3 a Good Data Annotator?
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq Joty
Boyang Albert Li
Lidong Bing
325
307
0
20 Dec 2022
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Ahmed Abdelreheem
Kyle Olszewski
Hsin-Ying Lee
Peter Wonka
Panos Achlioptas
3DPC
267
32
0
12 Dec 2022
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Wenliang Dai
Zihan Liu
Ziwei Ji
Jane Polak Scowcroft
Pascale Fung
MLLM
VLM
302
75
0
14 Oct 2022
SQA3D: Situated Question Answering in 3D Scenes
International Conference on Learning Representations (ICLR), 2022
Xiaojian Ma
Silong Yong
Zilong Zheng
Qing Li
Yitao Liang
Song-Chun Zhu
Siyuan Huang
LM&Ro
505
245
0
14 Oct 2022
Mask3D: Mask Transformer for 3D Semantic Instance Segmentation
IEEE International Conference on Robotics and Automation (ICRA), 2022
Jonas Schult
Francis Engelmann
Alexander Hermans
Or Litany
Siyu Tang
Bastian Leibe
ISeg
321
301
0
06 Oct 2022
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
Zhihao Yuan
Xu Yan
Zhuo Li
Xuhao Li
Yao Guo
Shuguang Cui
Zhen Li
188
18
0
05 Jul 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke
Eli VanderBilt
Alvaro Herrasti
Luca Weihs
Jordi Salvador
...
Winson Han
Eric Kolve
Ali Farhadi
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
318
367
0
14 Jun 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Neural Information Processing Systems (NeurIPS), 2022
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLM
VLM
695
4,826
0
29 Apr 2022
Language-Grounded Indoor 3D Semantic Segmentation in the Wild
European Conference on Computer Vision (ECCV), 2022
Dávid Rozenberszki
Or Litany
Angela Dai
3DV
VLM
385
257
0
16 Apr 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
2.3K
14,449
0
28 Jan 2022
ScanQA: 3D Question Answering for Spatial Scene Understanding
Computer Vision and Pattern Recognition (CVPR), 2021
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
M. Kawanabe
436
325
0
20 Dec 2021
Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Neural Information Processing Systems (NeurIPS), 2021
Andrew Szot
Alexander Clegg
Eric Undersander
Erik Wijmans
Yili Zhao
...
Z. Kira
V. Koltun
Jitendra Malik
Manolis Savva
Dhruv Batra
LM&Ro
393
638
0
28 Jun 2021
LoRA: Low-Rank Adaptation of Large Language Models
International Conference on Learning Representations (ICLR), 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
1.6K
15,273
0
17 Jun 2021
Grounding 'Grounding' in NLP
Findings (Findings), 2021
Khyathi Chandu
Yonatan Bisk
A. Black
167
57
0
04 Jun 2021
ManipulaTHOR: A Framework for Visual Object Manipulation
Computer Vision and Pattern Recognition (CVPR), 2021
Kiana Ehsani
Winson Han
Alvaro Herrasti
Eli VanderBilt
Luca Weihs
Eric Kolve
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
845
151
0
22 Apr 2021
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Computer Vision and Pattern Recognition (CVPR), 2020
Dave Zhenyu Chen
A. Gholami
Matthias Nießner
Angel X. Chang
3DPC
305
230
0
03 Dec 2020
3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics
IEEE International Conference on Computer Vision (ICCV), 2020
Huan Fu
Bowen Cai
Lin Gao
Ling-Xiao Zhang
Ying Li
Zengqi Xun
Chengyue Sun
Rongfei Jia
Binqiang Zhao
H. Zhang
3DV
289
354
0
18 Nov 2020
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
499
399
0
21 Apr 2020
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
Computer Vision and Pattern Recognition (CVPR), 2020
Matt Deitke
Winson Han
Alvaro Herrasti
Aniruddha Kembhavi
Eric Kolve
...
Eli VanderBilt
Matthew Wallingford
Luca Weihs
Mark Yatskar
Ali Farhadi
LM&Ro
293
280
0
14 Apr 2020
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
European Conference on Computer Vision (ECCV), 2019
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
3DPC
421
507
0
18 Dec 2019
Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
European Conference on Computer Vision (ECCV), 2019
Jia Zheng
Junfei Zhang
Jing Li
Rui Tang
Shenghua Gao
Zihan Zhou
3DV
354
346
0
01 Aug 2019
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
570
1,676
0
02 Apr 2019
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
402
907
0
07 Sep 2018
Object Hallucination in Image Captioning
Anna Rohrbach
Lisa Anne Hendricks
Kaylee Burns
Trevor Darrell
Kate Saenko
425
593
0
06 Sep 2018
AI2-THOR: An Interactive 3D Environment for Visual AI
Eric Kolve
Roozbeh Mottaghi
Winson Han
Eli VanderBilt
Luca Weihs
...
Daniel Gordon
Yuke Zhu
Aniruddha Kembhavi
Abhinav Gupta
Ali Farhadi
LM&Ro
679
1,295
0
14 Dec 2017
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Computer Vision and Pattern Recognition (CVPR), 2017
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
1.3K
4,902
0
14 Feb 2017
Modeling Context in Referring Expressions
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
570
1,522
0
31 Jul 2016
Microsoft COCO: Common Objects in Context
European Conference on Computer Vision (ECCV), 2014
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
17.5K
49,453
0
01 May 2014
Previous
1
2