Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2409.19457
Cited By
v1
v2
v3
v4 (latest)
A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
IEEE International Conference on Robotics and Automation (ICRA), 2024
28 September 2024
Houjian Yu
Mingen Li
Alireza Rezazadeh
Yang Yang
Changhyun Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping"
50 / 52 papers shown
Title
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
V. Bhat
Sungsu Kim
Valts Blukis
Greg Heinrich
Prashanth Krishnamurthy
Ramesh Karri
Stan Birchfield
Farshad Khorrami
Jonathan Tremblay
VLM
189
1
0
20 Nov 2025
LACY: A Vision-Language Model-based Language-Action Cycle for Self-Improving Robotic Manipulation
Youngjin Hong
Houjian Yu
Mingen Li
Changhyun Choi
LM&Ro
181
0
0
04 Nov 2025
Attribute-based Object Grounding and Robot Grasp Detection with Spatial Reasoning
Houjian Yu
Zheming Zhou
Min Sun
Omid Ghasemalizadeh
Yuyin Sun
Cheng-Hao Kuo
Arnie Sen
Changhyun Choi
82
0
0
09 Sep 2025
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
330
10
0
01 Aug 2025
MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping
V. Bhat
Naman Patel
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
226
0
0
06 Jun 2025
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
Xiaomeng Chu
Jiajun Deng
Guoliang You
Wei Liu
Xuzhao Li
Jianmin Ji
Yanzhe Zhang
292
1
0
20 Mar 2025
Attribute-Based Robotic Grasping with Data-Efficient Adaptation
IEEE Transactions on robotics (IEEE TRO), 2025
Yang Yang
Houjian Yu
Xibai Lou
Yuanhao Liu
Changhyun Choi
333
20
0
04 Jan 2025
ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter
Yaoyao Qian
Xu Zhu
Ondrej Biza
Shuo Jiang
Linfeng Zhao
Hao-zhe Huang
Yu Qi
Robert Platt
201
30
0
16 Jul 2024
Language-driven Grasp Detection
An Dinh Vuong
Minh Nhat Vu
Baoru Huang
Nghia Nguyen
Hieu Le
T. Vo
Anh Nguyen
VLM
297
29
0
13 Jun 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Siyuan Huang
Iaroslav Ponomarenko
Zhengkai Jiang
Xiaoqi Li
Xiaobin Hu
Shiyang Feng
Jiaming Song
Hao Dong
LM&Ro
294
38
0
17 Mar 2024
Reasoning Grasping via Multimodal Large Language Model
Conference on Robot Learning (CoRL), 2024
Shiyu Jin
Jinxuan Xu
Yutian Lei
Liangjun Zhang
LRM
222
35
0
09 Feb 2024
Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in Clutter
Conference on Robot Learning (CoRL), 2023
Georgios Tziafas
Yucheng Xu
Arushi Goel
Mohammadreza Kasaei
Zhibin Li
Hamidreza Kasaei
187
37
0
09 Nov 2023
Adversarial Object Rearrangement in Constrained Environments with Heterogeneous Graph Neural Networks
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Xibai Lou
Houjian Yu
Ross Worobel
Yang Yang
Changhyun Choi
191
7
0
27 Sep 2023
Beyond One-to-One: Rethinking the Referring Image Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Yutao Hu
Qixiong Wang
Wenqi Shao
Enze Xie
Zhenguo Li
Jungong Han
Ping Luo
3DV
195
63
0
26 Aug 2023
EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Yimin Yan
Xingjian He
Wenxuan Wang
Sihan Chen
Qingbin Liu
ObjD
VLM
254
2
0
18 Aug 2023
IOSG: Image-driven Object Searching and Grasping
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Houjian Yu
Xibai Lou
Yang Yang
Changhyun Choi
117
7
0
10 Aug 2023
VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Yuhao Lu
Yixuan Fan
Beixing Deng
Fan Liu
Yali Li
Shengjin Wang
222
55
0
01 Aug 2023
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Zunnan Xu
Zhihong Chen
Yong Zhang
Yibing Song
Xiang Wan
Guanbin Li
VLM
184
68
0
21 Jul 2023
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Edouard Grave
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
1.0K
5,722
0
14 Apr 2023
Task-Oriented Grasp Prediction with Visual-Language Inputs
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Chao Tang
Dehao Huang
Lingxiao Meng
Weiyu Liu
Kuanqi Cai
177
44
0
28 Feb 2023
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter
IEEE International Conference on Robotics and Automation (ICRA), 2023
Kechun Xu
Shuqing Zhao
Zhongxiang Zhou
Zizhang Li
Huaijin Pi
Yifeng Zhu
Yue Wang
R. Xiong
198
62
0
24 Feb 2023
Position-Aware Contrastive Alignment for Referring Image Segmentation
Bo Chen
Zhiwei Hu
Zhilong Ji
Jinfeng Bai
W. Zuo
199
9
0
27 Dec 2022
Self-Supervised Interactive Object Segmentation Through a Singulation-and-Grasping Approach
European Conference on Computer Vision (ECCV), 2022
Houjian Yu
Changhyun Choi
215
15
0
19 Jul 2022
Convolutional Bypasses Are Better Vision Transformer Adapters
European Conference on Artificial Intelligence (ECAI), 2022
Shibo Jie
Zhi-Hong Deng
VPVLM
229
156
0
14 Jul 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Neural Information Processing Systems (NeurIPS), 2022
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
478
911
0
26 May 2022
Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions
IEEE International Conference on Robotics and Automation (ICRA), 2022
Chi-Hou Cheang
Haitao Lin
Yanwei Fu
Xiangyang Xue
159
27
0
09 May 2022
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
Computer Vision and Pattern Recognition (CVPR), 2022
N. Kim
Dongwon Kim
Cuiling Lan
Wenjun Zeng
Suha Kwak
287
175
0
31 Mar 2022
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
Computer Vision and Pattern Recognition (CVPR), 2021
Zhao Yang
Yuan Liu
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Juil Sock
665
410
0
04 Dec 2021
CRIS: CLIP-Driven Referring Image Segmentation
Zhaoqing Wang
Yu Lu
Qiang Li
Xunqiang Tao
Yan Guo
Ming Gong
Tongliang Liu
VLM
385
438
0
30 Nov 2021
CLIPort: What and Where Pathways for Robotic Manipulation
Conference on Robot Learning (CoRL), 2021
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
270
800
0
24 Sep 2021
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
1.1K
3,229
0
02 Sep 2021
Vision-Language Transformer and Query Generation for Referring Segmentation
IEEE International Conference on Computer Vision (ICCV), 2021
Henghui Ding
Chang-rui Liu
Suchen Wang
Xudong Jiang
237
321
0
12 Aug 2021
LoRA: Low-Rank Adaptation of Large Language Models
International Conference on Learning Representations (ICLR), 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
1.5K
14,789
0
17 Jun 2021
Attribute-Based Robotic Grasping with One-Grasp Adaptation
IEEE International Conference on Robotics and Automation (ICRA), 2021
Yang Yang
Yuanhao Liu
Hengyue Liang
Xibai Lou
Changhyun Choi
117
18
0
06 Apr 2021
Collision-Aware Target-Driven Object Grasping in Constrained Environments
IEEE International Conference on Robotics and Automation (ICRA), 2021
Xibai Lou
Yang Yang
Changhyun Choi
166
36
0
01 Apr 2021
Efficient learning of goal-oriented push-grasping synergy in clutter
IEEE Robotics and Automation Letters (RA-L), 2021
Kechun Xu
Hongxiang Yu
Qianen Lai
Yue Wang
R. Xiong
231
85
0
09 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
International Conference on Machine Learning (ICML), 2021
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
2.0K
39,913
0
26 Feb 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiang Lisa Li
Abigail Z. Jacobs
642
5,102
0
01 Jan 2021
Visually Grounding Language Instruction for History-Dependent Manipulation
IEEE International Conference on Robotics and Automation (ICRA), 2020
Hyemin Ahn
Obin Kwon
Kyungdo Kim
Jaeyeon Jeong
Howoong Jun
Hongjung Lee
Dongheui Lee
Songhwai Oh
LM&Ro
203
7
0
16 Dec 2020
Exploring Simple Siamese Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2020
Xinlei Chen
Kaiming He
SSL
668
4,618
0
20 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
1.3K
53,494
0
22 Oct 2020
Unseen Object Instance Segmentation for Robotic Environments
IEEE Transactions on robotics (IEEE Trans. Robot.), 2020
Christopher Xie
Yu Xiang
Arsalan Mousavian
Dieter Fox
285
146
0
16 Jul 2020
Mechanical Search: Multi-Step Retrieval of a Target Object Occluded by Clutter
IEEE International Conference on Robotics and Automation (ICRA), 2019
Michael Danielczuk
Andrey Kurenkov
Ashwin Balakrishna
Matthew Matl
David Wang
Roberto Martín-Martín
Animesh Garg
Silvio Savarese
Ken Goldberg
135
119
0
04 Mar 2019
Parameter-Efficient Transfer Learning for NLP
International Conference on Machine Learning (ICML), 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
569
5,518
0
02 Feb 2019
Jacquard: A Large Scale Dataset for Robotic Grasp Detection
Amaury Depierre
Emmanuel Dellandrea
Liming Chen
258
363
0
30 Mar 2018
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Joey Tianyi Zhou
Tamara L. Berg
ObjD
389
905
0
24 Jan 2018
Attention Is All You Need
Neural Information Processing Systems (NeurIPS), 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
2.5K
157,684
0
12 Jun 2017
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
989
8,666
0
02 Dec 2016
Modeling Context Between Objects for Referring Expression Understanding
Varun K. Nagaraja
Vlad I. Morariu
Larry S. Davis
237
224
0
01 Aug 2016
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLM
EgoV
199
501
0
20 Mar 2016
1
2
Next