Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2109.03413
Cited By
v1
v2 (latest)
YouRefIt: Embodied Reference Understanding with Language and Gesture
IEEE International Conference on Computer Vision (ICCV), 2021
8 September 2021
Yixin Chen
Qing Li
Deqian Kong
Yik Lun Kei
Song-Chun Zhu
Tao Gao
Yixin Zhu
Siyuan Huang
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"YouRefIt: Embodied Reference Understanding with Language and Gesture"
13 / 13 papers shown
A Multimodal Depth-Aware Method For Embodied Reference Understanding
Fevziye Irem Eyiokur
Dogucan Yaman
H. K. Ekenel
Alexander Waibel
ObjD
380
0
0
09 Oct 2025
Learning to Generate Pointing Gestures in Situated Embodied Conversational Agents
Frontiers in Robotics and AI (Front. Robot. AI), 2023
Anna Deichler
Siyang Wang
Simon Alexanderson
Jonas Beskow
256
15
0
15 Sep 2025
Multimodal Data Storage and Retrieval for Embodied AI: A Survey
Yihao Lu
Hao Tang
157
3
0
19 Aug 2025
CAPE: A CLIP-Aware Pointing Ensemble of Complementary Heatmap Cues for Embodied Reference Understanding
Fevziye Irem Eyiokur
Dogucan Yaman
H. K. Ekenel
Alexander Waibel
301
0
0
29 Jul 2025
I see what you mean: Co-Speech Gestures for Reference Resolution in Multimodal Dialogue
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
E. Ghaleb
Bulat Khaertdinov
Aslı Özyürek
Raquel Fernández
401
0
0
27 Feb 2025
GSVA: Generalized Segmentation via Multimodal Large Language Models
Computer Vision and Pattern Recognition (CVPR), 2023
Zhuofan Xia
Dongchen Han
Yizeng Han
Xuran Pan
Shiji Song
Gao Huang
VLM
682
152
0
15 Dec 2023
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding
European Conference on Computer Vision (ECCV), 2023
Cheng Shi
Sibei Yang
LRM
195
14
0
03 Sep 2023
MEWL: Few-shot multimodal word learning with referential uncertainty
International Conference on Machine Learning (ICML), 2023
Guangyuan Jiang
Manjie Xu
Shiji Xin
Weihan Liang
Yujia Peng
Fangqiu Yi
Yixin Zhu
OffRL
348
29
0
01 Jun 2023
STRAP: Structured Object Affordance Segmentation with Point Supervision
Lei Cui
Xiaoxue Chen
Hao Zhao
Guyue Zhou
Yixin Zhu
3DPC
287
10
0
17 Apr 2023
ULN: Towards Underspecified Vision-and-Language Navigation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Weixi Feng
Tsu-Jui Fu
Yujie Lu
William Yang Wang
325
5
0
18 Oct 2022
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
Neural Information Processing Systems (NeurIPS), 2022
Zan Wang
Yixin Chen
Tengyu Liu
Yixin Zhu
Wei Liang
Siyuan Huang
264
178
0
18 Oct 2022
Understanding Embodied Reference with Touch-Line Transformer
International Conference on Learning Representations (ICLR), 2022
Yongqian Li
Xiaoxue Chen
Hao Zhao
Jiangtao Gong
Guyue Zhou
Federico Rossano
Yixin Zhu
366
21
0
11 Oct 2022
Distance-Aware Occlusion Detection with Focused Attention
IEEE Transactions on Image Processing (IEEE TIP), 2022
Yongqian Li
Yucheng Tu
Xiaoxue Chen
Hao Zhao
Guyue Zhou
205
9
0
23 Aug 2022
1
Page 1 of 1