Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02643
Cited By
Segment Anything
5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Segment Anything"
50 / 4,189 papers shown
Title
Consistent Image Layout Editing with Diffusion Models
Tao Xia
Yudi Zhang
Ting Liu Lei Zhang
DiffM
66
1
0
09 Mar 2025
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation
Chenfei Liao
Xu Zheng
Yuanhuiyi Lyu
Haiwei Xue
Yihong Cao
Jiawen Wang
Kailun Yang
Xuming Hu
VLM
66
7
0
09 Mar 2025
Dynamic Dictionary Learning for Remote Sensing Image Segmentation
Xuechao Zou
Yue Li
Shun Zhang
Kai Li
Shiying Wang
Pin Tao
Junliang Xing
Congyan Lang
50
0
0
09 Mar 2025
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement
Yuqi Liu
Bohao Peng
Zhisheng Zhong
Zihao Yue
Fanbin Lu
Bei Yu
Jiaya Jia
LRM
VLM
55
12
0
09 Mar 2025
AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation
Yang Zou
Zhaoshuai Qi
Yating Liu
Zihao Xu
Weipeng Sun
Weiyi Liu
Xingyuan Li
Jiaqi Yang
Yanning Zhang
44
0
0
09 Mar 2025
Continuous Online Adaptation Driven by User Interaction for Medical Image Segmentation
Wentian Xu
Ziyun Liang
Harry Anthony
Yasin Ibrahim
Felix Cohen
Guang Yang
Daniel Whitehouse
David Menon
Virginia Newcombe
Konstantinos Kamnitsas
OOD
51
0
0
09 Mar 2025
Towards Universal Text-driven CT Image Segmentation
Yuheng Li
Yuxiang Lai
Maria Thor
Deborah Marshall
Zachary Buchwald
D. Yu
Xiaofeng Yang
MedIm
VLM
67
3
0
08 Mar 2025
FloPE: Flower Pose Estimation for Precision Pollination
Rashik Shrestha
Madhav Rijal
T. Smith
Yu Gu
39
0
0
08 Mar 2025
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
Ziyue Huang
Yongchao Feng
Shuai Yang
Ziqiang Liu
Qingjie Liu
Yansen Wang
ObjD
220
0
0
08 Mar 2025
Segment Anything, Even Occluded
Wei-En Tai
Yu-Lin Shih
Cheng Sun
Y. Wang
Hwann-Tzong Chen
VLM
67
0
0
08 Mar 2025
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters
Jiaming Liu
Linghe Kong
Guihai Chen
76
0
0
08 Mar 2025
GAT-Grasp: Gesture-Driven Affordance Transfer for Task-Aware Robotic Grasping
Ruixiang Wang
Huayi Zhou
Xinyue Yao
Guiliang Liu
Kui Jia
44
0
0
08 Mar 2025
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning
Shuangzhi Li
Junlong Shen
Lei Ma
Xingyu Li
3DPC
53
0
0
08 Mar 2025
NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features
Hongjia Zhai
Boming Zhao
Hai Li
Xiaokun Pan
Yijia He
Zhaopeng Cui
Hujun Bao
Guofeng Zhang
83
2
0
08 Mar 2025
Dynamically evolving segment anything model with continuous learning for medical image segmentation
Zhaori Liu
Mengyang Li
Hu Han
Enli Zhang
Shiguang Shan
Zhiming Zhao
VLM
57
0
0
08 Mar 2025
Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
Seil Kang
Jinyeong Kim
Junhyeok Kim
Seong Jae Hwang
VLM
93
2
0
08 Mar 2025
Do Fairness Interventions Come at the Cost of Privacy: Evaluations for Binary Classifiers
Huan Tian
Guangsheng Zhang
Bo Liu
Tianqing Zhu
Ming Ding
Wanlei Zhou
55
0
0
08 Mar 2025
SplatTalk: 3D VQA with Gaussian Splatting
Anh Thai
Songyou Peng
Kyle Genova
Leonidas J. Guibas
Thomas Funkhouser
3DGS
88
0
0
08 Mar 2025
S4M: Segment Anything with 4 Extreme Points
A. Meyer
Lorenzo Arboit
Giuseppe Massimiani
Francesco Brucchi
Luca Emanuele Amodio
Didier Mutter
N. Padoy
42
0
0
07 Mar 2025
GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting
Zheng Zhou
Zhe Li
Bo Yu
Lina Hu
Liang Dong
...
Xiaoli Liu
N. Xu
Zehao Wang
Yonghao Dang
Jianqin Yin
3DGS
3DV
53
0
0
07 Mar 2025
SAS: Segment Anything Small for Ultrasound -- A Non-Generative Data Augmentation Technique for Robust Deep Learning in Ultrasound Imaging
Danielle L. Ferreira
Ahana Gangopadhyay
Hsi-Ming Chang
Ravi Soni
Gopal Avinash
45
0
0
07 Mar 2025
Bayesian Fields: Task-driven Open-Set Semantic Gaussian Splatting
Dominic Maggio
Luca Carlone
204
0
0
07 Mar 2025
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
Miaowei Wang
Yibo Zhang
R. Ma
Weiwei Xu
C. Zou
Daniel Morris
3DV
51
1
0
07 Mar 2025
Stereo Any Video: Temporally Consistent Stereo Matching
Junpeng Jing
Weixun Luo
Ye Mao
K. Mikolajczyk
51
0
0
07 Mar 2025
GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding
Xihan Wang
Dianyi Yang
Yu Gao
Yufeng Yue
Yi Yang
M. Fu
3DGS
54
0
0
06 Mar 2025
OPG-Policy: Occluded Push-Grasp Policy Learning with Amodal Segmentation
Hao Ding
Yiming Zeng
Zhaoliang Wan
Hui Cheng
63
1
0
06 Mar 2025
GBT-SAM: Adapting a Foundational Deep Learning Model for Generalizable Brain Tumor Segmentation via Efficient Integration of Multi-Parametric MRI Data
Cecilia Diana-Albelda
Roberto Alcover-Couso
Álvaro García-Martín
Jesús Bescós
Marcos Escudero-Viñolo
50
1
0
06 Mar 2025
WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining
Haoran Wang
Lian Huai
Wenbin Li
Lei Qi
Xingqun Jiang
Yinghuan Shi
MedIm
67
2
0
06 Mar 2025
VLA Model-Expert Collaboration for Bi-directional Manipulation Learning
Tian-Yu Xiang
Ao-Qun Jin
Xiao-Hu Zhou
Mei-Jiang Gui
Xiao-Liang Xie
...
Shuang-Yi Wang
Sheng-Bin Duang
Si-Cheng Wang
Zheng Lei
Z. Hou
66
1
0
06 Mar 2025
ViT-VS: On the Applicability of Pretrained Vision Transformer Features for Generalizable Visual Servoing
Alessandro Scherl
Stefan Thalhammer
Bernhard Neuberger
Wilfried Wöber
José Gracía-Rodríguez
71
0
0
06 Mar 2025
DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation
Amin Karimi
Charalambos Poullis
VLM
51
0
0
06 Mar 2025
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
Aishik Konwer
Zhijian Yang
Erhan Bas
Cao Xiao
Prateek Prasanna
Parminder Bhatia
Taha A. Kass-Hout
MedIm
VLM
73
0
0
06 Mar 2025
IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement
Zhihao Shi
Dong Huo
Yuhongze Zhou
Kejia Yin
Yan Min
Juwei Lu
Wei Ji
69
1
0
06 Mar 2025
From Architectural Sketch to Conceptual Representation: Using Structure-Aware Diffusion Model to Generate Renderings of School Buildings
Zhengyang Wang
H. Jin
Xusheng Du
Yuxiao Ren
Ye Zhang
H. Xie
DiffM
52
0
0
05 Mar 2025
Task-Agnostic Attacks Against Vision Foundation Models
Brian Pulfer
Yury Belousov
Vitaliy Kinakh
Teddy Furon
S. Voloshynovskiy
AAML
77
0
0
05 Mar 2025
Interactive Segmentation and Report Generation for CT Images
Yannian Gu
Wenhui Lei
Hanyu Chen
Xiaofan Zhang
S. Zhang
60
0
0
05 Mar 2025
GenColor: Generative Color-Concept Association in Visual Design
Yihan Hou
Xingchen Zeng
Yusong Wang
Manling Yang
Xiaojiao Chen
Wei Zeng
DiffM
78
0
0
05 Mar 2025
AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model
Wenlun Zhang
Shimpei Ando
Kentaro Yoshioka
VLM
MQ
67
0
0
05 Mar 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Naoto Yokoya
48
0
0
05 Mar 2025
From Infants to AI: Incorporating Infant-like Learning in Models Boosts Efficiency and Generalization in Learning Social Prediction Tasks
Shify Treger
Shimon Ullman
61
0
0
05 Mar 2025
AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons
Hongjie Fang
Chenxi Wang
Yiming Wang
J. Chen
Shangning Xia
...
Xinyu Zhan
Lixin Yang
Weiming Wang
Cewu Lu
Hao-Shu Fang
87
1
0
05 Mar 2025
TopoMortar: A dataset to evaluate image segmentation methods focused on topology accuracy
Juan Miguel Valverde
Motoya Koga
Nijihiko Otsuka
Anders Bjorholm Dahl
38
0
0
05 Mar 2025
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Suhwan Cho
Seunghoon Lee
Minhyeok Lee
Jungho Lee
Sangyoun Lee
VOS
77
0
0
05 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
59
1
0
05 Mar 2025
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning
Borong Zhang
Yuhao Zhang
Yalan Qin
Yingshan Lei
Josef Dai
Yuanpei Chen
Yaodong Yang
66
4
0
05 Mar 2025
SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection
Devanish N. Kamtam
Joseph B. Shrager
Satya Deepya Malla
Xiaohan Wang
Nicole Lin
Juan J. Cardona
Serena Yeung-Levy
Clarence Hu
VLM
55
0
0
05 Mar 2025
Building 3D In-Context Learning Universal Model in Neuroimaging
Jiesi Hu
Hanyang Peng
Yanwu Yang
Xutao Guo
Yang Shang
P. Shi
Chenfei Ye
Ting Ma
67
0
0
04 Mar 2025
Four Principles for Physically Interpretable World Models
Jordan Peper
Zhenjiang Mao
Yuang Geng
Siyuan Pan
Ivan Ruchkin
110
1
0
04 Mar 2025
Label-Efficient LiDAR Panoptic Segmentation
Ahmet Selim Çanakçı
Niclas Vodisch
Kürsat Petek
Wolfram Burgard
Abhinav Valada
3DPC
86
0
0
04 Mar 2025
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
Jiayi Zhao
Fei Teng
Kai Luo
Guoqiang Zhao
Zehan Li
Xu Zheng
Kailun Yang
VLM
79
6
0
04 Mar 2025
Previous
1
2
3
...
12
13
14
...
82
83
84
Next