Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2306.14289
Cited By
v1
v2 (latest)
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
25 June 2023
Chaoning Zhang
Dongshen Han
Yu Qiao
Jung Uk Kim
Sung-Ho Bae
Seungkyu Lee
Choong Seon Hong
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (15 upvotes)
Github (5194★)
Papers citing
"Faster Segment Anything: Towards Lightweight SAM for Mobile Applications"
50 / 264 papers shown
Title
EasyHeC++: Fully Automatic Hand-Eye Calibration with Pretrained Image Models
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Zhengdong Hong
Kangfu Zheng
Linghao Chen
VLM
160
8
0
11 Oct 2024
Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images
Proceedings of the ACM on Computer Graphics and Interactive Techniques (CGIT), 2024
Virmarie Maquiling
Sean Anthony Byrne
D. Niehorster
Marco Carminati
Enkelejda Kasneci
VLM
290
2
0
11 Oct 2024
Thing2Reality: Transforming 2D Content into Conditioned Multiviews and 3D Gaussian Objects for XR Communication
Erzhen Hu
Mingyi Li
Jungtaek Hong
Xun Qian
A. Olwal
David Kim
Seongkook Heo
Ruofei Du
123
2
0
09 Oct 2024
Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for Efficient Banana Plantation Segmentation in UAV Imagery
Frontiers in Plant Science (Front. Plant Sci.), 2024
Ang He
Ximei Wu
Xing Xu
Jing Chen
Xiaobin Guo
Sheng Xu
147
5
0
09 Oct 2024
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Neural Information Processing Systems (NeurIPS), 2024
Zhiwei Lin
Yongtao Wang
Zhi Tang
ObjD
VLM
180
14
0
08 Oct 2024
Anticipating Human Behavior for Safe Navigation and Efficient Collaborative Manipulation with Mobile Service Robots
S. Bultmann
Raphael Memmesheimer
Jan Nogga
Julian Hau
Sven Behnke
253
0
0
07 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
International Journal of Computer Vision (IJCV), 2024
Xiaorui Sun
Jing Liu
Jikang Cheng
Xiaofeng Zhu
Ping Hu
VLM
445
18
0
07 Oct 2024
Helpful DoggyBot: Open-World Object Fetching using Legged Robots and Vision-Language Models
Qi Wu
Zipeng Fu
Xuxin Cheng
Xiaolong Wang
Chelsea Finn
LM&Ro
248
14
0
30 Sep 2024
Playful DoggyBot: Learning Agile and Precise Quadrupedal Locomotion
Xin Duan
Ziwen Zhuang
Hang Zhao
Soeren Schwertfeger
351
2
0
30 Sep 2024
DarkSAM: Fooling Segment Anything Model to Segment Nothing
Neural Information Processing Systems (NeurIPS), 2024
Ziqi Zhou
Yufei Song
Minghui Li
Shengshan Hu
Xianlong Wang
Leo Yu Zhang
Dezhong Yao
Hai Jin
240
27
0
26 Sep 2024
Global-Local Medical SAM Adaptor Based on Full Adaption
Meng Wang
Yarong Feng
Yongwei Tang
Tian Zhang
Yuxin Liang
Chao Lv
MedIm
212
1
0
26 Sep 2024
SAMEdge: An Edge-cloud Video Analytics Architecture for the Segment Anything Model
Rui Lu
Siping Shi
Yanting Liu
Dan Wang
VLM
132
2
0
23 Sep 2024
A Modular Robotic System for Autonomous Exploration and Semantic Updating in Large-Scale Indoor Environments
Sai Haneesh Allu
Itay Kadosh
Tyler Summers
Yu Xiang
390
0
0
23 Sep 2024
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Nanqing Liu
Xun Xu
Yongyi Su
Haojie Zhang
Heng-Chao Li
VLM
462
48
0
20 Sep 2024
Automatic Behavior Tree Expansion with LLMs for Robotic Manipulation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Jonathan Styrud
Matteo Iovino
M. Norrlöf
Mårten Björkman
Christian Smith
LLMAG
164
12
0
20 Sep 2024
GraspSAM: When Segment Anything Model Meets Grasp Detection
IEEE International Conference on Robotics and Automation (ICRA), 2024
Sangjun Noh
Jongwon Kim
Dongwoo Nam
Seunghyeok Back
Raeyoung Kang
Kyoobin Lee
VLM
318
10
0
19 Sep 2024
Vision Language Models Can Parse Floor Plan Maps
David DeFazio
Hrudayangam Mehta
Meng Wang
Ping Yang
Jeremy Blackburn
Shiqi Zhang
CoGe
287
4
0
19 Sep 2024
Privacy-Preserving SAM Quantization for Efficient Edge Intelligence in Healthcare
Zhikai Li
Jing Zhang
Qingyi Gu
MedIm
239
3
0
14 Sep 2024
Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets
Ruochen Gao
Donghang Lyu
Marius Staring
VLM
MedIm
89
7
0
11 Sep 2024
Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts
Assefa Seyoum Wahd
B. Felfeliyan
Yuyue Zhou
Shrimanti Ghosh
Adam McArthur
Jiechen Zhang
Jacob L. Jaremko
A. Hareendranathan
VLM
MedIm
199
6
0
10 Sep 2024
Towards Generalizable Scene Change Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Jaewoo Kim
Uehwan Kim
468
7
0
10 Sep 2024
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Xi Chen
Haosen Yang
Sheng Jin
Xiatian Zhu
Huanjin Yao
VLM
215
5
0
05 Sep 2024
ReMoBot: Retrieval-Based Few-Shot Imitation Learning for Mobile Manipulation with Vision Foundation Models
Yuying Zhang
Wenyan Yang
Francesco Verdoja
Ville Kyrki
Joni Pajarinen
451
3
0
28 Aug 2024
OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation
Muhammad Rameez Ur Rahman
Piero Simonetto
Anna Polato
Francesco Pasti
Luca Tonin
Sebastiano Vascon
3DPC
145
1
0
25 Aug 2024
Target-Oriented Object Grasping via Multimodal Human Guidance
Pengwei Xie
Siang Chen
Dingchang Hu
Yixiang Dai
Kaiqin Yang
Guijin Wang
248
5
0
20 Aug 2024
Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving
Jun Yan
Pengyu Wang
Danni Wang
Weiquan Huang
Daniel Watzenig
Huilin Yin
AAML
VLM
185
6
0
19 Aug 2024
Segment Anything with Multiple Modalities
Aoran Xiao
Weihao Xuan
Heli Qi
Yun Xing
Xiangwei Zhu
Shijian Lu
VLM
271
11
0
17 Aug 2024
A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot
Haoxuan Ding
Qi. Wang
Junyu Gao
Qiang Li
VLM
201
2
0
11 Aug 2024
One Shot is Enough for Sequential Infrared Small Target Segmentation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Bingbing Dan
Meihui Li
Tao Tang
Jing Zhang
158
0
0
09 Aug 2024
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More
Tianrun Chen
Ankang Lu
Lanyun Zhu
Chaotao Ding
Chunan Yu
Deyi Ji
Ziyue Li
Lingyun Sun
Papa Mao
Ying Zang
VLM
MedIm
223
57
0
08 Aug 2024
Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation
Asian Conference on Computer Vision (ACCV), 2024
Stéphane Vujasinović
Kamil Dreczkowski
Sebastian Bullinger
Norbert Scherer-Negenborn
Michael Arens
VOS
254
1
0
31 Jul 2024
The Collection of a Human Robot Collaboration Dataset for Cooperative Assembly in Glovebox Environments
Shivansh Sharma
Mathew Huang
Sanat Nair
Alan Wen
Christina Petlowany
Juston Moore
Selma Wanna
Mitch Pryor
298
2
0
19 Jul 2024
De-LightSAM: Modality-Decoupled Lightweight SAM for Generalizable Medical Segmentation
Qing Xu
Jiaxuan Li
Xiangjian He
Ziyu Liu
Daming Gao
Wenting Duan
Chenxin Li
Maggie M. He
Fiseha B. Tesema
W. Cheah
MedIm
VLM
389
0
0
19 Jul 2024
VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation
Zhen Qu
Xian Tao
Mukesh Prasad
Fei Shen
Zhengtao Zhang
Xinyi Gong
Guiguang Ding
VLM
240
48
0
17 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
VGen
DiffM
491
28
0
17 Jul 2024
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes
Zhi Cai
Yingjie Gao
Yaoyan Zheng
Nan Zhou
Di Huang
VLM
319
13
0
16 Jul 2024
Lite-SAM Is Actually What You Need for Segment Everything
Jianhai Fu
Yuanjie Yu
Ningchuan Li
Yi Zhang
Qichao Chen
Jianping Xiong
Jun Yin
Zhiyu Xiang
VLM
213
5
0
12 Jul 2024
AutoMate: Specialist and Generalist Assembly Policies over Diverse Geometries
Bingjie Tang
Iretiayo Akinola
Jie Xu
Bowen Wen
Ankur Handa
Karl Van Wyk
Dieter Fox
Gaurav Sukhatme
Fabio Ramos
Yashraj S. Narang
395
29
0
10 Jul 2024
IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection
Mingjin Zhang
Yuchun Wang
Jie-Ru Guo
Yunsong Li
Xinbo Gao
Jing Zhang
VLM
176
100
0
10 Jul 2024
Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Sriram Yenamandra
Arun Ramachandran
Mukul Khanna
Karmesh Yadav
Jay Vakil
...
Z. Kira
Dhruv Batra
Roozbeh Mottaghi
Yonatan Bisk
Chris Paxton
LM&Ro
257
9
0
09 Jul 2024
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models
Ruinan Jin
Zikang Xu
Yuan Zhong
Qiongsong Yao
Qi Dou
S. Kevin Zhou
Xiaoxiao Li
VLM
353
39
0
01 Jul 2024
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Yuxuan Zhang
Tianheng Cheng
Lianghui Zhu
Lei Liu
Heng Liu
Longjin Ran
Xiaoxin Chen
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
VLM
539
53
0
28 Jun 2024
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Haobo Yuan
Xiangtai Li
Lu Qi
Tao Zhang
Ming-Hsuan Yang
Shuicheng Yan
Chen Change Loy
VLM
269
20
0
27 Jun 2024
XAMI -- A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images
Elisabeta-Iulia Dima
Pablo Gómez
Sandor Kruk
Peter Kretschmar
Simon Rosen
Călin-Adrian Popa
165
0
0
25 Jun 2024
TraceNet: Segment one thing efficiently
Mingyuan Wu
Zichuan Liu
Haozhen Zheng
Hongpeng Guo
Bo Chen
Xin Lu
Klara Nahrstedt
285
0
0
21 Jun 2024
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
285
23
0
20 Jun 2024
Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation
Diwei Sheng
Giles Hamilton-Fletcher
Mahya Beheshti
Chen Feng
John-Ross Rizzo
194
3
0
11 Jun 2024
Open-Vocabulary Part-Based Grasping
Tjeard van Oort
Dimity Miller
Will N. Browne
Nicolas Marticorena
Jesse Haviland
Niko Suenderhauf
3DPC
187
4
0
10 Jun 2024
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Dmytro Kuzmenko
N. Shvai
LM&Ro
207
0
0
05 Jun 2024
Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy
Yunho Kim
Jeong Hyun Lee
Choongin Lee
Juhyeok Mun
D. Youm
Jeongsoo Park
Jemin Hwangbo
163
8
0
05 Jun 2024
Previous
1
2
3
4
5
6
Next