Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2306.14289
Cited By
v1
v2 (latest)
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
25 June 2023
Chaoning Zhang
Dongshen Han
Yu Qiao
Jung Uk Kim
Sung-Ho Bae
Seungkyu Lee
Choong Seon Hong
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (15 upvotes)
Github (5194★)
Papers citing
"Faster Segment Anything: Towards Lightweight SAM for Mobile Applications"
50 / 264 papers shown
Title
SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation
Jia Wang
Yunan Mei
Jiarui Liu
Xin Fan
169
0
0
29 Apr 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
281
0
0
22 Apr 2025
ApexNav: An Adaptive Exploration Strategy for Zero-Shot Object Navigation with Target-centric Semantic Fusion
IEEE Robotics and Automation Letters (IEEE RA-L), 2025
Mingjie Zhang
Yuheng Du
Chengkai Wu
Jinni Zhou
Zhenchao Qi
Jun Ma
Boyu Zhou
528
5
0
20 Apr 2025
SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping
Yun-Cheng Li
Sen Lei
Yi Zhao
Heng-Chao Li
Jun Li
Antonio J. Plaza
216
2
0
17 Apr 2025
Long Range Navigator (LRN): Extending robot planning horizons beyond metric maps
Matt Schmittle
Rohan Baijal
Nathan Hatch
Rosario Scalise
Mateo Guaman Castro
Sidharth Talia
Khimya Khetarpal
Byron Boots
S. Srinivasa
172
3
0
17 Apr 2025
CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
Ruohao Zhan
Yijin Li
Yisheng He
Shuo Chen
Yichen Shen
Xinyu Chen
Zilong Dong
Zhaoyang Huang
Guofeng Zhang
DiffM
252
1
0
11 Apr 2025
Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction
Junyi Ma
Wentao Bao
Jingyi Xu
Guanzhong Sun
Xieyuanli Chen
Hesheng Wang
208
4
0
10 Apr 2025
IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments
Computer Vision and Pattern Recognition (CVPR), 2025
Can Zhang
G. Lee
210
4
0
09 Apr 2025
HER-Seg: Holistically Efficient Segmentation for High-Resolution Medical Images
Qing Xu
Zhenye Lou
Chenxin Li
Xiangjian He
Rong Qu
Tesema Fiseha Berhanu
Yi Wang
Wenting Duan
Daming Gao
MedIm
193
0
0
08 Apr 2025
InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation
Jiangsan Zhao
Jakob Geipel
Krzysztof Kusnierek
Xuean Cui
3DPC
165
1
0
08 Apr 2025
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
Ilan Naiman
Emanuel Ben-Baruch
Oron Anschel
Alon Shoshan
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
253
1
0
04 Apr 2025
Semi-SMD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving
Yusen Xie
Zhengmin Huang
Shaojie Shen
Jun Ma
360
1
0
25 Mar 2025
MapGlue: Multimodal Remote Sensing Image Matching
Peihao Wu
Yongxiang Yao
Wenfei Zhang
Dong Wei
Qiong Wu
Yansheng Li
Yongjun Zhang
207
5
0
20 Mar 2025
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation
Yuwen Du
Anning Hu
Zichen Chao
Yifan Lu
Junhao Ge
Genjia Liu
Weitao Wu
Lanjun Wang
Siheng Chen
374
0
0
13 Mar 2025
Visual and Text Prompt Segmentation: A Novel Multi-Model Framework for Remote Sensing
Xing Zi
Kairui Jin
Xian Tao
Jun Li
Ali Braytee
Rajiv Ratn Shah
Mukesh Prasad
VLM
164
1
0
10 Mar 2025
RS2-SAM2: Customized SAM2 for Referring Remote Sensing Image Segmentation
Fu Rong
Meng Lan
Qian Zhang
Guang Dai
409
1
0
10 Mar 2025
SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model
Jing Zhang
Zhiyu Li
Chengzhi Hu
Xuewen Liu
Qingyi Gu
VLM
MQ
197
0
0
09 Mar 2025
Bayesian Fields: Task-driven Open-Set Semantic Gaussian Splatting
Dominic Maggio
Luca Carlone
818
1
0
07 Mar 2025
SAS: Segment Anything Small for Ultrasound -- A Non-Generative Data Augmentation Technique for Robust Deep Learning in Ultrasound Imaging
Danielle L. Ferreira
Ahana Gangopadhyay
Hsi-Ming Chang
Ravi Soni
Gopal Avinash
228
0
0
07 Mar 2025
A Token-level Text Image Foundation Model for Document Understanding
Tongkun Guan
Zining Wang
Pei Fu
Zhengtao Guo
Wei Shen
...
Chen Duan
Hao Sun
Qianyi Jiang
Junfeng Luo
Yunbo Wang
VLM
482
4
0
04 Mar 2025
Language-Guided Object Search in Agricultural Environments
IEEE International Conference on Robotics and Automation (ICRA), 2025
Advaith Balaji
Saket Pradhan
Dmitry Berenson
LM&Ro
266
1
0
03 Mar 2025
OVAMOS: A Framework for Open-Vocabulary Multi-Object Search in Unknown Environments
Qianwei Wang
Yifan Xu
V. Kamat
Carol Menassa
256
2
0
03 Mar 2025
Vision Foundation Models in Medical Image Analysis: Advances and Challenges
Pengchen Liang
Bin Pu
Haishan Huang
Yiwei Li
Jian Shu
Weibo Ma
Qing Chang
VLM
MedIm
335
4
0
24 Feb 2025
ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation
International Conference on 3D Vision (3DV), 2023
Yuheng Xue
Nenglun Chen
Jun Liu
Wenyun Sun
3DPC
522
13
0
24 Feb 2025
OpenBench: A New Benchmark and Baseline for Semantic Navigation in Smart Logistics
IEEE International Conference on Robotics and Automation (ICRA), 2025
Junhui Wang
Dongjie Huo
Zehui Xu
Yongliang Shi
Yimin Yan
Yuanxin Wang
Chao Gao
Yan Qiao
Guyue Zhou
373
3
0
13 Feb 2025
ImitDiff: Transferring Foundation-Model Priors for Distraction Robust Visuomotor Policy
IEEE Robotics and Automation Letters (IEEE RA-L), 2025
Yuhang Dong
Haizhou Ge
Yupei Zeng
Jing Zhang
Beiwen Tian
...
Ruixiang Wang
Ruixiang Wang
Ran Yi
Longhua Ma
Longhua Ma
271
1
0
11 Feb 2025
Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Tongkun Liu
Bing Li
Xiao Jin
Yupeng Shi
Qiuying Li
Xiang Wei
385
2
0
03 Feb 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Qian Zhang
Guang Dai
VOS
VGen
534
3
0
23 Jan 2025
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
Kaiyu Li
Xiangyong Cao
Yupeng Deng
Chao Pang
Zepeng Xin
Deyu Meng
Zhi Wang
ObjD
290
7
0
22 Jan 2025
Challenge Summary U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation
Xin Wang
Xiaoyu Liu
Peng Huang
Pu Huang
Shu Hu
Hongtu Zhu
UQCV
357
0
0
20 Jan 2025
SkipClick: Combining Quick Responses and Low-Level Features for Interactive Segmentation in Winter Sports Contexts
Robin Schon
Julian Lorenz
Daniel Kienzle
Rainer Lienhart
194
1
0
14 Jan 2025
EdgeTAM: On-Device Track Anything Model
Computer Vision and Pattern Recognition (CVPR), 2025
Chong Zhou
Chenchen Zhu
Yunyang Xiong
Saksham Suri
Fanyi Xiao
...
Raghuraman Krishnamoorthi
Bo Dai
Chen Change Loy
Vikas Chandra
Bilge Soran
VLM
284
8
0
13 Jan 2025
From Simple to Complex Skills: The Case of In-Hand Object Reorientation
IEEE International Conference on Robotics and Automation (ICRA), 2025
Haozhi Qi
Brent Yi
Mike Lambeta
Yi-An Ma
Roberto Calandra
Jitendra Malik
173
10
0
10 Jan 2025
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis
AAAI Conference on Artificial Intelligence (AAAI), 2025
Yiliang Chen
Steven SC Ho
Cheng Xu
Yao Jie Xie
Wing-Fai Yeung
Shengfeng He
Jing Qin
LM&MA
280
1
0
06 Jan 2025
Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment
Haisheng Lu
Yujie Fu
Fan Zhang
Le Zhang
MedIm
MQ
210
1
0
15 Dec 2024
Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes
Zhangjun Zhou
Yiping Li
Chunlin Zhong
Jianuo Huang
Jialun Pei
He Tang
He Tang
418
0
0
14 Dec 2024
Towards Real-Time Open-Vocabulary Video Instance Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Bin Yan
Martin Sundermeyer
D. Tan
Huchuan Lu
F. Tombari
VLM
VOS
264
3
0
05 Dec 2024
The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs
Christina Kassab
Matías Mattamala
Sacha Morin
Martin Buchner
Abhinav Valada
Liam Paull
Maurice F. Fallon
253
9
0
02 Dec 2024
Rapid Distributed Fine-tuning of a Segmentation Model Onboard Satellites
International Conference on Image Processing, Applications and Systems (ICIPAS), 2024
Meghan Plumridge
Rasmus Maråk
Chiara Ceccobello
Pablo Gómez
Gabriele Meoni
F. Svoboda
Nicholas D. Lane
261
0
0
26 Nov 2024
g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks
Computer Vision and Pattern Recognition (CVPR), 2024
Zihan Wang
Gim Hee Lee
228
7
0
26 Nov 2024
CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
Dengke Zhang
Fagui Liu
Quan Tang
VLM
576
4
0
15 Nov 2024
Watermark Anything with Localized Messages
International Conference on Learning Representations (ICLR), 2024
Tom Sander
Pierre Fernandez
Alain Durmus
Teddy Furon
Matthijs Douze
VLM
398
31
0
11 Nov 2024
AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution
European Conference on Computer Vision (ECCV), 2024
Yuanting Fan
Chengxu Liu
Nengzhong Yin
Changlong Gao
Xueming Qian
165
8
0
23 Oct 2024
PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model
Zhongchen Deng
Zhechen Yang
Chi Chen
Cheng Zeng
Yan Meng
Bisheng Yang
177
3
0
21 Oct 2024
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
International Conference on Learning Representations (ICLR), 2024
Yuxiang Lu
Shengcao Cao
Yu-Xiong Wang
463
5
0
18 Oct 2024
BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation
Haechan Mark Bong
Ricardo de Azambuja
Giovanni Beltrame
VLM
149
1
0
16 Oct 2024
Order-aware Interactive Segmentation
International Conference on Learning Representations (ICLR), 2024
Bin Wang
Anwesa Choudhuri
Meng Zheng
Zhongpai Gao
Benjamin Planche
Andong Deng
Qin Liu
Terrence Chen
Ulas Bagci
Ziyan Wu
VLM
904
2
0
16 Oct 2024
RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation
Neural Information Processing Systems (NeurIPS), 2024
Anton Antonov
Andrey Moskalenko
Denis Shepelev
Alexander Krapukhin
Konstantin Soshin
Anton Konushin
V. Shakhuro
269
1
0
15 Oct 2024
Real-Time Localization and Bimodal Point Pattern Analysis of Palms Using UAV Imagery
Kangning Cui
Wei Tang
Rongkun Zhu
Manqi Wang
Gregory Larsen
...
Jordan Karubian
Raymond H. Chan
R. Plemmons
Jean-Michel Morel
Miles Silman
148
7
0
14 Oct 2024
Words to Wheels: Vision-Based Autonomous Driving Understanding Human Language Instructions Using Foundation Models
Chanhoe Ryu
Hyunki Seong
Daegyu Lee
Seongwoo Moon
Sungjae Min
David Hyunchul Shim
190
2
0
14 Oct 2024
Previous
1
2
3
4
5
6
Next