ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.03413
  4. Cited By
YouRefIt: Embodied Reference Understanding with Language and Gesture

YouRefIt: Embodied Reference Understanding with Language and Gesture

8 September 2021
Yixin Chen
Qing Li
Deqian Kong
Yik Lun Kei
Song-Chun Zhu
Tao Gao
Yixin Zhu
Siyuan Huang
    LM&Ro
ArXivPDFHTML

Papers citing "YouRefIt: Embodied Reference Understanding with Language and Gesture"

24 / 24 papers shown
Title
Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
Hao Guo
Jianfei Zhu
Wei Fan
Chunzhi Yi
Feng Jiang
ObjD
63
0
0
25 Mar 2025
I see what you mean: Co-Speech Gestures for Reference Resolution in Multimodal Dialogue
E. Ghaleb
Bulat Khaertdinov
Aslı Özyürek
Raquel Fernández
34
0
0
27 Feb 2025
AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference
  Understanding
AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference Understanding
Hao Guo
Wei Fan
Baichun Wei
Jianfei Zhu
Jin Tian
Chunzhi Yi
Feng Jiang
34
0
0
13 Nov 2024
Robi Butler: Multimodal Remote Interaction with a Household Robot Assistant
Robi Butler: Multimodal Remote Interaction with a Household Robot Assistant
Anxing Xiao
Nuwan Janaka
Tianrun Hu
Anshul Gupta
Kaixin Li
Cunjun Yu
David Hsu
LM&Ro
32
3
0
30 Sep 2024
SYNERGAI: Perception Alignment for Human-Robot Collaboration
SYNERGAI: Perception Alignment for Human-Robot Collaboration
Yixin Chen
Guoxi Zhang
Yaowei Zhang
Hongming Xu
Peiyuan Zhi
Qing Li
Siyuan Huang
32
0
0
24 Sep 2024
Investigating the Role of Instruction Variety and Task Difficulty in
  Robotic Manipulation Tasks
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Amit Parekh
Nikolas Vitsakis
Alessandro Suglia
Ioannis Konstas
AAML
33
5
0
04 Jul 2024
Generating Human Motion in 3D Scenes from Text Descriptions
Generating Human Motion in 3D Scenes from Text Descriptions
Zhi Cen
Huaijin Pi
Sida Peng
Zehong Shen
Minghui Yang
Shuai Zhu
Hujun Bao
Xiaowei Zhou
36
19
0
13 May 2024
Move as You Say, Interact as You Can: Language-guided Human Motion
  Generation with Scene Affordance
Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
Zan Wang
Yixin Chen
Baoxiong Jia
Puhao Li
Jinlu Zhang
Jingze Zhang
Tengyu Liu
Yixin Zhu
Wei Liang
Siyuan Huang
VGen
DiffM
39
36
0
26 Mar 2024
GSVA: Generalized Segmentation via Multimodal Large Language Models
GSVA: Generalized Segmentation via Multimodal Large Language Models
Zhuofan Xia
Dongchen Han
Yizeng Han
Xuran Pan
Shiji Song
Gao Huang
VLM
23
54
0
15 Dec 2023
ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
Jieming Cui
Ziren Gong
Baoxiong Jia
Siyuan Huang
Zilong Zheng
Jianzhu Ma
Yixin Zhu
25
3
0
01 Nov 2023
HandMeThat: Human-Robot Communication in Physical and Social
  Environments
HandMeThat: Human-Robot Communication in Physical and Social Environments
Yanming Wan
Jiayuan Mao
J. Tenenbaum
44
16
0
05 Oct 2023
DetermiNet: A Large-Scale Diagnostic Dataset for Complex
  Visually-Grounded Referencing using Determiners
DetermiNet: A Large-Scale Diagnostic Dataset for Complex Visually-Grounded Referencing using Determiners
Clarence Lee
M Ganesh Kumar
Cheston Tan
28
3
0
07 Sep 2023
Gesture-Informed Robot Assistance via Foundation Models
Gesture-Informed Robot Assistance via Foundation Models
Li-Heng Lin
Yuchen Cui
Yilun Hao
Fei Xia
Dorsa Sadigh
LM&Ro
SLR
13
19
0
06 Sep 2023
Spatial and Visual Perspective-Taking via View Rotation and Relation
  Reasoning for Embodied Reference Understanding
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding
Cheng Shi
Sibei Yang
LRM
21
6
0
03 Sep 2023
MEWL: Few-shot multimodal word learning with referential uncertainty
MEWL: Few-shot multimodal word learning with referential uncertainty
Guangyuan Jiang
Manjie Xu
Shiji Xin
Weihan Liang
Yujia Peng
Chi Zhang
Yixin Zhu
OffRL
21
16
0
01 Jun 2023
STRAP: Structured Object Affordance Segmentation with Point Supervision
STRAP: Structured Object Affordance Segmentation with Point Supervision
Lei Cui
Xiaoxue Chen
Hao Zhao
Guyue Zhou
Yixin Zhu
3DPC
28
6
0
17 Apr 2023
ScanERU: Interactive 3D Visual Grounding based on Embodied Reference
  Understanding
ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding
Ziyang Lu
Yunqiang Pei
Guoqing Wang
Yang Yang
Zheng Wang
Heng Tao Shen
46
6
0
23 Mar 2023
ULN: Towards Underspecified Vision-and-Language Navigation
ULN: Towards Underspecified Vision-and-Language Navigation
Weixi Feng
Tsu-jui Fu
Yujie Lu
William Yang Wang
31
4
0
18 Oct 2022
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
Zan Wang
Yixin Chen
Tengyu Liu
Yixin Zhu
Wei Liang
Siyuan Huang
29
103
0
18 Oct 2022
Understanding Embodied Reference with Touch-Line Transformer
Understanding Embodied Reference with Touch-Line Transformer
Y. Li
Xiaoxue Chen
Hao Zhao
Jiangtao Gong
Guyue Zhou
Federico Rossano
Yixin Zhu
156
15
0
11 Oct 2022
Distance-Aware Occlusion Detection with Focused Attention
Distance-Aware Occlusion Detection with Focused Attention
Y. Li
Yucheng Tu
Xiaoxue Chen
Hao Zhao
Guyue Zhou
20
6
0
23 Aug 2022
Communicative Learning with Natural Gestures for Embodied Navigation
  Agents with Human-in-the-Scene
Communicative Learning with Natural Gestures for Embodied Navigation Agents with Human-in-the-Scene
Qi Wu
Cheng-Ju Wu
Yixin Zhu
Jungseock Joo
38
14
0
05 Aug 2021
Multi-task Collaborative Network for Joint Referring Expression
  Comprehension and Segmentation
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
ObjD
159
286
0
19 Mar 2020
Convolutional LSTM Network: A Machine Learning Approach for
  Precipitation Nowcasting
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
201
7,902
0
13 Jun 2015
1