ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.12514
  4. Cited By
Language Grounding with 3D Objects

Language Grounding with 3D Objects

26 July 2021
Jesse Thomason
Mohit Shridhar
Yonatan Bisk
Chris Paxton
Luke Zettlemoyer
    LM&Ro
ArXivPDFHTML

Papers citing "Language Grounding with 3D Objects"

18 / 18 papers shown
Title
Why Are You Wrong? Counterfactual Explanations for Language Grounding with 3D Objects
Why Are You Wrong? Counterfactual Explanations for Language Grounding with 3D Objects
Tobias Preintner
Weixuan Yuan
Qi Huang
Adrian König
Thomas Bäck
E. Raponi
N. V. Stein
29
0
0
09 May 2025
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
57
0
0
14 Oct 2024
ET tu, CLIP? Addressing Common Object Errors for Unseen Environments
ET tu, CLIP? Addressing Common Object Errors for Unseen Environments
Ye Won Byun
Cathy Jiao
Shahriar Noroozizadeh
Jimin Sun
Rosa Vitiello
VLM
44
1
0
25 Jun 2024
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Han-Hung Lee
Yiming Zhang
Angel X. Chang
3DPC
45
3
0
17 Jun 2024
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation
  Models
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation Models
Chao Tang
Dehao Huang
Wenlong Dong
Ruinian Xu
Hong Zhang
36
9
0
16 Apr 2024
DetermiNet: A Large-Scale Diagnostic Dataset for Complex
  Visually-Grounded Referencing using Determiners
DetermiNet: A Large-Scale Diagnostic Dataset for Complex Visually-Grounded Referencing using Determiners
Clarence Lee
M Ganesh Kumar
Cheston Tan
28
3
0
07 Sep 2023
Paparazzi: A Deep Dive into the Capabilities of Language and Vision
  Models for Grounding Viewpoint Descriptions
Paparazzi: A Deep Dive into the Capabilities of Language and Vision Models for Grounding Viewpoint Descriptions
Henrik Voigt
J. Hombeck
M. Meuschke
K. Lawonn
Sina Zarrieß
VLM
30
1
0
13 Feb 2023
Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding
Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding
Yaoxian Song
Penglei Sun
Piaopiao Jin
Yi Ren
Yu Zheng
Zhixu Li
Xiaowen Chu
Yueying Zhang
Tiefeng Li
Jason Gu
63
14
0
27 Jan 2023
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved
  Visio-Linguistic Models in 3D Scenes
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes
Ahmed Abdelreheem
Kyle Olszewski
Hsin-Ying Lee
Peter Wonka
Panos Achlioptas
3DPC
22
28
0
12 Dec 2022
LADIS: Language Disentanglement for 3D Shape Editing
LADIS: Language Disentanglement for 3D Shape Editing
Ian Huang
Panos Achlioptas
Tianyi Zhang
Sergey Tulyakov
Minhyuk Sung
Leonidas J. Guibas
26
10
0
09 Dec 2022
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
Zan Wang
Yixin Chen
Tengyu Liu
Yixin Zhu
Wei Liang
Siyuan Huang
40
103
0
18 Oct 2022
Music-to-Text Synaesthesia: Generating Descriptive Text from Music
  Recordings
Music-to-Text Synaesthesia: Generating Descriptive Text from Music Recordings
Zhihuan Kuang
Shi Zong
Jianbing Zhang
Jiajun Chen
Hongfu Liu
22
4
0
02 Oct 2022
Robots Enact Malignant Stereotypes
Robots Enact Malignant Stereotypes
Andrew Hundt
William Agnew
V. Zeng
Severin Kacianka
Matthew C. Gombolay
LM&Ro
35
41
0
23 Jul 2022
Voxel-informed Language Grounding
Voxel-informed Language Grounding
Rodolfo Corona
Shizhan Zhu
Dan Klein
Trevor Darrell
133
11
0
19 May 2022
TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval
TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval
Yue Ruan
Han-Hung Lee
Yiming Zhang
Ke Zhang
Angel X. Chang
32
22
0
19 Jan 2022
PartGlot: Learning Shape Part Segmentation from Language Reference Games
PartGlot: Learning Shape Part Segmentation from Language Reference Games
Juil Koo
Ian Huang
Panos Achlioptas
Leonidas J. Guibas
Minhyuk Sung
3DPC
30
28
0
13 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning
  and Visual Grounding
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
21
29
0
02 Dec 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
260
157
0
02 Jan 2021
1