Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2306.14289
Cited By
v1
v2 (latest)
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
25 June 2023
Chaoning Zhang
Dongshen Han
Yu Qiao
Jung Uk Kim
Sung-Ho Bae
Seungkyu Lee
Choong Seon Hong
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (15 upvotes)
Github (5194★)
Papers citing
"Faster Segment Anything: Towards Lightweight SAM for Mobile Applications"
50 / 264 papers shown
Title
Open-Set Domain Adaptation for Semantic Segmentation
Seun-An Choe
Ah-Hyung Shin
Keon-Hee Park
Jinwoo Choi
Gyeong-Moon Park
184
16
0
30 May 2024
Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures
Hongjun Wu
Li Xiao
Xingkuo Zhang
Yining Miao
268
2
0
28 May 2024
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
Qihang Zhang
Yinghao Xu
Chaoyang Wang
Hsin-Ying Lee
Gordon Wetzstein
Bolei Zhou
Ceyuan Yang
3DGS
198
18
0
28 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
724
11
0
28 May 2024
PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus
Zhaochen Liu
Limeng Qiao
Xiangxiang Chu
Tingting Jiang
230
2
0
25 May 2024
Tuning-free Universally-Supervised Semantic Segmentation
IEEE Access (IEEE Access), 2024
Xiaobo Yang
Xiaojin Gong
VLM
236
3
0
23 May 2024
MetaFruit Meets Foundation Models: Leveraging a Comprehensive Multi-Fruit Dataset for Advancing Agricultural Foundation Models
Jiajia Li
Kyle Lammers
Xunyuan Yin
Xiang Yin
Long He
Renfu Lu
Zhaojian Li
156
6
0
14 May 2024
PTQ4SAM: Post-Training Quantization for Segment Anything
Computer Vision and Pattern Recognition (CVPR), 2024
Chengtao Lv
Hong Chen
Jinyang Guo
Yifu Ding
Xianglong Liu
VLM
MQ
165
29
0
06 May 2024
M
2
{^2}
2
Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation
European Conference on Computer Vision (ECCV), 2024
Yingshuang Zou
Yikang Ding
Xi Qiu
Haoqian Wang
Haotian Zhang
MDE
213
7
0
03 May 2024
Innovative Integration of Visual Foundation Model with a Robotic Arm on a Mobile Platform
Shimian Zhang
Qiuhong Lu
199
1
0
29 Apr 2024
Online,Target-Free LiDAR-Camera Extrinsic Calibration via Cross-Modal Mask Matching
Zhiwei Huang
Yikang Zhang
Qijun Chen
Rui Fan
295
12
0
28 Apr 2024
What Foundation Models can Bring for Robot Learning in Manipulation : A Survey
Dingzhe Li
Yixiang Jin
A. Yong
Yong A
Hongze Yu
...
Huaping Liu
Gang Hua
F. Sun
Jianwei Zhang
Bin Fang
AI4CE
LM&Ro
855
24
0
28 Apr 2024
Moving Object Segmentation: All You Need Is SAM (and Flow)
Junyu Xie
Charig Yang
Weidi Xie
Andrew Zisserman
335
23
0
18 Apr 2024
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model
Han Gu
Haoyu Dong
Jichen Yang
Maciej A. Mazurowski
MedIm
VLM
331
42
0
15 Apr 2024
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Junchi Wang
Lei Ke
MLLM
LRM
VLM
215
59
0
12 Apr 2024
Practical Region-level Attack against Segment Anything Models
Yifan Shen
Zhengyuan Li
Gang Wang
VLM
179
20
0
12 Apr 2024
SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation
Computational materials science (Comput. Mater. Sci.), 2024
Waqwoya Abebe
Jan Strube
Luanzheng Guo
Nathan R. Tallent
Oceane Bel
Steven Spurgeon
Christina Doty
Ali Jannesari
115
7
0
09 Apr 2024
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Computer Vision and Pattern Recognition (CVPR), 2024
Xu He
Qiaochu Huang
Zhensong Zhang
Zhiwei Lin
Zhiyong Wu
Sicheng Yang
Minglei Li
Zhiyi Chen
Songcen Xu
Xiaofei Wu
167
27
0
02 Apr 2024
Vision-language models for decoding provider attention during neonatal resuscitation
Felipe Parodi
Jordan K Matelsky
Alejandra Regla-Vargas
Elizabeth E. Foglia
Charis Lim
Danielle Weinberg
Konrad Kording
Heidi Herrick
Michael L Platt
119
1
0
01 Apr 2024
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts
Qin Liu
Jaemin Cho
Mohit Bansal
Marc Niethammer
VLM
167
26
0
31 Mar 2024
Annolid: Annotate, Segment, and Track Anything You Need
Chen Yang
Thomas A. Cleland
VOS
137
2
0
27 Mar 2024
Segment Any Medical Model Extended
Yihao Liu
Kailai Li
Andres Diaz-Pinto
Haowei Li
A. Martin-Gomez
Amir Kheradmand
Mehran Armand
VLM
MedIm
137
8
0
26 Mar 2024
Fast Sparse View Guided NeRF Update for Object Reconfigurations
Ziqi Lu
Jianbo Ye
Xiaohan Fei
Xiaolong Li
Jiawei Mo
Ashwin Swaminathan
Stefano Soatto
245
4
0
16 Mar 2024
CSDNet: Detect Salient Object in Depth-Thermal via A Lightweight Cross Shallow and Deep Perception Network
Xiaotong Yu
Ruihan Xie
Zhihe Zhao
Chang-Wen Chen
129
0
0
15 Mar 2024
Group-Mix SAM: Lightweight Solution for Industrial Assembly Line Applications
Wu Liang
X.-G. Ma
151
0
0
15 Mar 2024
FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Yiqing Shen
Jingxing Li
Xinyuan Shao
Blanca Inigo Romillo
Ankush Jindal
David Dreizin
Mathias Unberath
MedIm
190
25
0
14 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
169
8
0
14 Mar 2024
SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration
Yanfei Song
Bangzheng Pu
Peng Wang
Hongxu Jiang
Dong Dong
Yongxiang Cao
Yiqing Shen
VLM
228
16
0
14 Mar 2024
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Chris Kelly
Luhui Hu
Bang Yang
Yu Tian
Deshun Yang
Cindy Yang
Zaoshan Huang
Zihao Li
Jiayin Hu
Yuexian Zou
161
14
0
14 Mar 2024
Q-SLAM: Quadric Representations for Monocular SLAM
Conference on Robot Learning (CoRL), 2024
Chensheng Peng
Chenfeng Xu
Yue Wang
Mingyu Ding
Heng Yang
Masayoshi Tomizuka
Kurt Keutzer
Marco Pavone
Wei Zhan
179
14
0
12 Mar 2024
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything
Healthcare technology letters (HTL), 2024
Zijian Wu
Adam Schmidt
Peter Kazanzides
Septimiu E. Salcudean
211
5
0
12 Mar 2024
SAMDA: Leveraging SAM on Few-Shot Domain Adaptation for Electronic Microscopy Segmentation
IEEE International Symposium on Biomedical Imaging (ISBI), 2024
Yiran Wang
Li Xiao
MedIm
146
7
0
12 Mar 2024
PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models
Qingdong He
Jinlong Peng
Zhengkai Jiang
Xiaobin Hu
Jiangning Zhang
3DPC
VLM
431
8
0
11 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
238
65
0
08 Mar 2024
SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising
Tao Zhou
Tong Lu
Qi Ye
Zhiguo Shi
Jiming Chen
VLM
204
3
0
07 Mar 2024
A SAM-guided Two-stream Lightweight Model for Anomaly Detection
Chenghao Li
Lei Qi
Xin Geng
222
16
0
29 Feb 2024
SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution
Chengcheng Wang
Zhiwei Hao
Yehui Tang
Jianyuan Guo
Yujie Yang
Kai Han
Yunhe Wang
378
17
0
27 Feb 2024
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
593
19
0
22 Feb 2024
Object-level Geometric Structure Preserving for Natural Image Stitching
Wenxiao Cai
Wankou Yang
204
10
0
20 Feb 2024
TETRIS: Towards Exploring the Robustness of Interactive Segmentation
Andrey Moskalenko
V. Shakhuro
Anna Vorontsova
Anton Konushin
Anton Antonov
Alexander Krapukhin
Denis Shepelev
Konstantin Soshin
AAML
196
2
0
09 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
257
92
0
08 Feb 2024
EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss
Zhuoyang Zhang
Han Cai
Song Han
VLM
235
4
0
07 Feb 2024
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model
European Conference on Computer Vision (ECCV), 2024
Aoran Xiao
Weihao Xuan
Heli Qi
Yun Xing
Ruijie Ren
Xiaoqin Zhang
Ling Shao
Shijian Lu
VLM
MLLM
263
28
0
06 Feb 2024
Region-Based Representations Revisited
Michal Shlapentokh-Rothman
Ansel Blume
Yao Xiao
Yuqun Wu
TV Sethuraman
Heyi Tao
Jae Yong Lee
Wilfredo Torres
Yu-Xiong Wang
Derek Hoiem
424
14
0
04 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
190
31
0
31 Jan 2024
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
Xu Hu
Yuxi Wang
Lue Fan
Junsong Fan
Junran Peng
Zhen Lei
Qing Li
Zhaoxiang Zhang
Zhaoxiang Zhang
3DGS
471
22
0
31 Jan 2024
MESA: Matching Everything by Segmenting Anything
Yesheng Zhang
Xu Zhao
154
15
0
30 Jan 2024
GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis
Jing Hao
Moyun Liu
Kuo Feng Hung
DiffM
124
2
0
27 Jan 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
335
803
0
25 Jan 2024
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images
IEEE journal of biomedical and health informatics (IEEE JBHI), 2024
Jia Wan
Wanhua Li
Jason Ken Adhinarta
Atmadeep Banerjee
Evelina Sjostedt
Jingpeng Wu
J. Lichtman
Hanspeter Pfister
D. Wei
314
9
0
25 Jan 2024
Previous
1
2
3
4
5
6
Next