ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.19457
  4. Cited By
A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
v1v2v3v4 (latest)

A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping

IEEE International Conference on Robotics and Automation (ICRA), 2024
28 September 2024
Houjian Yu
Mingen Li
Alireza Rezazadeh
Yang Yang
Changhyun Choi
ArXiv (abs)PDFHTML

Papers citing "A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping"

50 / 52 papers shown
Title
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
V. Bhat
Sungsu Kim
Valts Blukis
Greg Heinrich
Prashanth Krishnamurthy
Ramesh Karri
Stan Birchfield
Farshad Khorrami
Jonathan Tremblay
VLM
189
1
0
20 Nov 2025
LACY: A Vision-Language Model-based Language-Action Cycle for Self-Improving Robotic Manipulation
LACY: A Vision-Language Model-based Language-Action Cycle for Self-Improving Robotic Manipulation
Youngjin Hong
Houjian Yu
Mingen Li
Changhyun Choi
LM&Ro
181
0
0
04 Nov 2025
Attribute-based Object Grounding and Robot Grasp Detection with Spatial Reasoning
Attribute-based Object Grounding and Robot Grasp Detection with Spatial Reasoning
Houjian Yu
Zheming Zhou
Min Sun
Omid Ghasemalizadeh
Yuyin Sun
Cheng-Hao Kuo
Arnie Sen
Changhyun Choi
82
0
0
09 Sep 2025
Multimodal Referring Segmentation: A Survey
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
330
10
0
01 Aug 2025
MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping
MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping
V. Bhat
Naman Patel
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
226
0
0
06 Jun 2025
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
Xiaomeng Chu
Jiajun Deng
Guoliang You
Wei Liu
Xuzhao Li
Jianmin Ji
Yanzhe Zhang
292
1
0
20 Mar 2025
Attribute-Based Robotic Grasping with Data-Efficient Adaptation
Attribute-Based Robotic Grasping with Data-Efficient AdaptationIEEE Transactions on robotics (IEEE TRO), 2025
Yang Yang
Houjian Yu
Xibai Lou
Yuanhao Liu
Changhyun Choi
333
20
0
04 Jan 2025
ThinkGrasp: A Vision-Language System for Strategic Part Grasping in
  Clutter
ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter
Yaoyao Qian
Xu Zhu
Ondrej Biza
Shuo Jiang
Linfeng Zhao
Hao-zhe Huang
Yu Qi
Robert Platt
201
30
0
16 Jul 2024
Language-driven Grasp Detection
Language-driven Grasp Detection
An Dinh Vuong
Minh Nhat Vu
Baoru Huang
Nghia Nguyen
Hieu Le
T. Vo
Anh Nguyen
VLM
297
29
0
13 Jun 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded
  Information into Multi-Modal Large Language Models
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Siyuan Huang
Iaroslav Ponomarenko
Zhengkai Jiang
Xiaoqi Li
Xiaobin Hu
Shiyang Feng
Jiaming Song
Hao Dong
LM&Ro
294
38
0
17 Mar 2024
Reasoning Grasping via Multimodal Large Language Model
Reasoning Grasping via Multimodal Large Language ModelConference on Robot Learning (CoRL), 2024
Shiyu Jin
Jinxuan Xu
Yutian Lei
Liangjun Zhang
LRM
222
35
0
09 Feb 2024
Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in
  Clutter
Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in ClutterConference on Robot Learning (CoRL), 2023
Georgios Tziafas
Yucheng Xu
Arushi Goel
Mohammadreza Kasaei
Zhibin Li
Hamidreza Kasaei
187
37
0
09 Nov 2023
Adversarial Object Rearrangement in Constrained Environments with
  Heterogeneous Graph Neural Networks
Adversarial Object Rearrangement in Constrained Environments with Heterogeneous Graph Neural NetworksIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Xibai Lou
Houjian Yu
Ross Worobel
Yang Yang
Changhyun Choi
191
7
0
27 Sep 2023
Beyond One-to-One: Rethinking the Referring Image Segmentation
Beyond One-to-One: Rethinking the Referring Image SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Yutao Hu
Qixiong Wang
Wenqi Shao
Enze Xie
Zhenguo Li
Jungong Han
Ping Luo
3DV
195
63
0
26 Aug 2023
EAVL: Explicitly Align Vision and Language for Referring Image
  Segmentation
EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Yimin Yan
Xingjian He
Wenxuan Wang
Sihan Chen
Qingbin Liu
ObjDVLM
254
2
0
18 Aug 2023
IOSG: Image-driven Object Searching and Grasping
IOSG: Image-driven Object Searching and GraspingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Houjian Yu
Xibai Lou
Yang Yang
Changhyun Choi
117
7
0
10 Aug 2023
VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects
  in Cluttered Indoor Scenes
VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor ScenesIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Yuhao Lu
Yixuan Fan
Beixing Deng
Fan Liu
Yali Li
Shengjin Wang
222
55
0
01 Aug 2023
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for
  Referring Image Segmentation
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Zunnan Xu
Zhihong Chen
Yong Zhang
Yibing Song
Xiang Wan
Guanbin Li
VLM
184
68
0
21 Jul 2023
DINOv2: Learning Robust Visual Features without Supervision
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Edouard Grave
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLMCLIPSSL
1.0K
5,722
0
14 Apr 2023
Task-Oriented Grasp Prediction with Visual-Language Inputs
Task-Oriented Grasp Prediction with Visual-Language InputsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Chao Tang
Dehao Huang
Lingxiao Meng
Weiyu Liu
Kuanqi Cai
177
44
0
28 Feb 2023
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping
  in Clutter
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in ClutterIEEE International Conference on Robotics and Automation (ICRA), 2023
Kechun Xu
Shuqing Zhao
Zhongxiang Zhou
Zizhang Li
Huaijin Pi
Yifeng Zhu
Yue Wang
R. Xiong
198
62
0
24 Feb 2023
Position-Aware Contrastive Alignment for Referring Image Segmentation
Position-Aware Contrastive Alignment for Referring Image Segmentation
Bo Chen
Zhiwei Hu
Zhilong Ji
Jinfeng Bai
W. Zuo
199
9
0
27 Dec 2022
Self-Supervised Interactive Object Segmentation Through a
  Singulation-and-Grasping Approach
Self-Supervised Interactive Object Segmentation Through a Singulation-and-Grasping ApproachEuropean Conference on Computer Vision (ECCV), 2022
Houjian Yu
Changhyun Choi
215
15
0
19 Jul 2022
Convolutional Bypasses Are Better Vision Transformer Adapters
Convolutional Bypasses Are Better Vision Transformer AdaptersEuropean Conference on Artificial Intelligence (ECAI), 2022
Shibo Jie
Zhi-Hong Deng
VPVLM
229
156
0
14 Jul 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual
  Recognition
AdaptFormer: Adapting Vision Transformers for Scalable Visual RecognitionNeural Information Processing Systems (NeurIPS), 2022
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
478
911
0
26 May 2022
Learning 6-DoF Object Poses to Grasp Category-level Objects by Language
  Instructions
Learning 6-DoF Object Poses to Grasp Category-level Objects by Language InstructionsIEEE International Conference on Robotics and Automation (ICRA), 2022
Chi-Hou Cheang
Haitao Lin
Yanwei Fu
Xiangyang Xue
159
27
0
09 May 2022
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
ReSTR: Convolution-free Referring Image Segmentation Using TransformersComputer Vision and Pattern Recognition (CVPR), 2022
N. Kim
Dongwon Kim
Cuiling Lan
Wenjun Zeng
Suha Kwak
287
175
0
31 Mar 2022
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
LAVT: Language-Aware Vision Transformer for Referring Image SegmentationComputer Vision and Pattern Recognition (CVPR), 2021
Zhao Yang
Yuan Liu
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Juil Sock
665
410
0
04 Dec 2021
CRIS: CLIP-Driven Referring Image Segmentation
CRIS: CLIP-Driven Referring Image Segmentation
Zhaoqing Wang
Yu Lu
Qiang Li
Xunqiang Tao
Yan Guo
Ming Gong
Tongliang Liu
VLM
385
438
0
30 Nov 2021
CLIPort: What and Where Pathways for Robotic Manipulation
CLIPort: What and Where Pathways for Robotic ManipulationConference on Robot Learning (CoRL), 2021
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
270
800
0
24 Sep 2021
Learning to Prompt for Vision-Language Models
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLMCLIPVLM
1.1K
3,229
0
02 Sep 2021
Vision-Language Transformer and Query Generation for Referring
  Segmentation
Vision-Language Transformer and Query Generation for Referring SegmentationIEEE International Conference on Computer Vision (ICCV), 2021
Henghui Ding
Chang-rui Liu
Suchen Wang
Xudong Jiang
237
321
0
12 Aug 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
1.5K
14,789
0
17 Jun 2021
Attribute-Based Robotic Grasping with One-Grasp Adaptation
Attribute-Based Robotic Grasping with One-Grasp AdaptationIEEE International Conference on Robotics and Automation (ICRA), 2021
Yang Yang
Yuanhao Liu
Hengyue Liang
Xibai Lou
Changhyun Choi
117
18
0
06 Apr 2021
Collision-Aware Target-Driven Object Grasping in Constrained
  Environments
Collision-Aware Target-Driven Object Grasping in Constrained EnvironmentsIEEE International Conference on Robotics and Automation (ICRA), 2021
Xibai Lou
Yang Yang
Changhyun Choi
166
36
0
01 Apr 2021
Efficient learning of goal-oriented push-grasping synergy in clutter
Efficient learning of goal-oriented push-grasping synergy in clutterIEEE Robotics and Automation Letters (RA-L), 2021
Kechun Xu
Hongxiang Yu
Qianen Lai
Yue Wang
R. Xiong
231
85
0
09 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language SupervisionInternational Conference on Machine Learning (ICML), 2021
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
2.0K
39,913
0
26 Feb 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiang Lisa Li
Abigail Z. Jacobs
642
5,102
0
01 Jan 2021
Visually Grounding Language Instruction for History-Dependent
  Manipulation
Visually Grounding Language Instruction for History-Dependent ManipulationIEEE International Conference on Robotics and Automation (ICRA), 2020
Hyemin Ahn
Obin Kwon
Kyungdo Kim
Jaeyeon Jeong
Howoong Jun
Hongjung Lee
Dongheui Lee
Songhwai Oh
LM&Ro
203
7
0
16 Dec 2020
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningComputer Vision and Pattern Recognition (CVPR), 2020
Xinlei Chen
Kaiming He
SSL
668
4,618
0
20 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
1.3K
53,494
0
22 Oct 2020
Unseen Object Instance Segmentation for Robotic Environments
Unseen Object Instance Segmentation for Robotic EnvironmentsIEEE Transactions on robotics (IEEE Trans. Robot.), 2020
Christopher Xie
Yu Xiang
Arsalan Mousavian
Dieter Fox
285
146
0
16 Jul 2020
Mechanical Search: Multi-Step Retrieval of a Target Object Occluded by
  Clutter
Mechanical Search: Multi-Step Retrieval of a Target Object Occluded by ClutterIEEE International Conference on Robotics and Automation (ICRA), 2019
Michael Danielczuk
Andrey Kurenkov
Ashwin Balakrishna
Matthew Matl
David Wang
Roberto Martín-Martín
Animesh Garg
Silvio Savarese
Ken Goldberg
135
119
0
04 Mar 2019
Parameter-Efficient Transfer Learning for NLP
Parameter-Efficient Transfer Learning for NLPInternational Conference on Machine Learning (ICML), 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
569
5,518
0
02 Feb 2019
Jacquard: A Large Scale Dataset for Robotic Grasp Detection
Jacquard: A Large Scale Dataset for Robotic Grasp Detection
Amaury Depierre
Emmanuel Dellandrea
Liming Chen
258
363
0
30 Mar 2018
MAttNet: Modular Attention Network for Referring Expression
  Comprehension
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Joey Tianyi Zhou
Tamara L. Berg
ObjD
389
905
0
24 Jan 2018
Attention Is All You Need
Attention Is All You NeedNeural Information Processing Systems (NeurIPS), 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
2.5K
157,684
0
12 Jun 2017
Overcoming catastrophic forgetting in neural networks
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
989
8,666
0
02 Dec 2016
Modeling Context Between Objects for Referring Expression Understanding
Modeling Context Between Objects for Referring Expression Understanding
Varun K. Nagaraja
Vlad I. Morariu
Larry S. Davis
237
224
0
01 Aug 2016
Segmentation from Natural Language Expressions
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLMEgoV
199
501
0
20 Mar 2016
12
Next