Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02643
Cited By
Segment Anything
5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Segment Anything"
50 / 4,188 papers shown
Title
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset
Jiakang Yuan
Bo-Wen Zhang
Xiangchao Yan
Tao Chen
Botian Shi
Yikang Li
Yu Qiao
3DPC
18
25
0
01 Jun 2023
DeSAM: Decoupled Segment Anything Model for Generalizable Medical Image Segmentation
Yifan Gao
W. Xia
Dingdu Hu
Wenkui Wang
Xin Gao
OOD
VLM
MedIm
18
29
0
01 Jun 2023
SAM-helps-Shadow:When Segment Anything Model meet shadow removal
Xiaofeng Zhang
Chaochen Gu
Shanying Zhu
VLM
39
11
0
01 Jun 2023
Sea Ice Extraction via Remote Sensed Imagery: Algorithms, Datasets, Applications and Challenges
Anzhu Yu
Wenjun Huang
Qing Xu
Qun Sun
Wenyue Guo
Song Ji
Bowei Wen
C. Qiu
27
3
0
01 Jun 2023
Using Visual Cropping to Enhance Fine-Detail Question Answering of BLIP-Family Models
Jiarui Zhang
Mahyar Khayatkhoei
P. Chhikara
Filip Ilievski
27
1
0
31 May 2023
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
Zhongwei Wan
Che Liu
Mi Zhang
Jie Fu
Benyou Wang
Sibo Cheng
Lei Ma
César Quilodrán-Casas
Rossella Arcucci
50
71
0
31 May 2023
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
Aoran Xiao
Xiaoqin Zhang
Ling Shao
Shijian Lu
3DPC
41
18
0
31 May 2023
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Sivan Doveh
Assaf Arbelle
Sivan Harary
Roei Herzig
Donghyun Kim
...
Yikang Shen
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLM
CoGe
47
52
0
31 May 2023
PaintSeg: Training-free Segmentation via Painting
Xiang Li
Chung-Ching Lin
Yinpeng Chen
Zicheng Liu
Jinglu Wang
Bhiksha Raj
40
5
0
30 May 2023
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Chuhao Jin
Wenhui Tan
Jiange Yang
Bei Liu
Ruihua Song
Limin Wang
Jianlong Fu
LM&Ro
LRM
27
24
0
30 May 2023
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Rui Yang
Lin Song
Yanwei Li
Sijie Zhao
Yixiao Ge
Xiu Li
Ying Shan
SyDa
MLLM
33
209
0
30 May 2023
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
Pengzhi Li
Qinxuan Huang
Yikang Ding
Zhiheng Li
DiffM
27
35
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
32
78
0
29 May 2023
Pix2Repair: Implicit Shape Restoration from Images
Xinchao Song
N. Lamb
Sean Banerjee
N. Banerjee
3DV
29
0
0
29 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
50
89
0
29 May 2023
Explicit Visual Prompting for Universal Foreground Segmentations
Weihuang Liu
Xi Shen
Chi-Man Pun
Xiaodong Cun
VPVLM
VLM
38
14
0
29 May 2023
GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions
Tao Wang
Kaihao Zhang
Ziqian Shao
Wenhan Luo
B. Stenger
Tong Lu
Tae-Kyun Kim
Wei Liu
Hongdong Li
ViT
32
30
0
29 May 2023
AIMS: All-Inclusive Multi-Level Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Jiuxiang Gu
Zhe-nan Lin
Bo Du
Yu-Syuan Xu
Ming-Hsuan Yang
VLM
21
6
0
28 May 2023
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
29
12
0
27 May 2023
VoxDet: Voxel Learning for Novel Instance Detection
Bowen Li
Jiashun Wang
Yaoyu Hu
Chen Wang
Sebastian Scherer
38
6
0
26 May 2023
Building One-class Detector for Anything: Open-vocabulary Zero-shot OOD Detection Using Text-image Models
Yunhao Ge
Jie Jessie Ren
Jiaping Zhao
Kaifeng Chen
Andrew Gallagher
Laurent Itti
Balaji Lakshminarayanan
VLM
ObjD
26
1
0
26 May 2023
Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU
Haoran Liu
Peng Li
Ming-Yu Liu
Kai-Ming Wang
Zhuo Zuo
Bingqi Liu
33
1
0
26 May 2023
OpenVIS: Open-vocabulary Video Instance Segmentation
Pinxue Guo
Tony Huang
Peiyang He
Xuefeng Liu
Tianjun Xiao
Zhaoyu Chen
Wenqiang Zhang
VLM
36
16
0
26 May 2023
Detect Any Shadow: Segment Anything for Video Shadow Detection
Yonghui Wang
Wen-gang Zhou
Yunyao Mao
Houqiang Li
VLM
21
22
0
26 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
22
30
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
30
165
0
25 May 2023
Imitating Task and Motion Planning with Visuomotor Transformers
Murtaza Dalal
Ajay Mandlekar
Caelan Reed Garrett
Ankur Handa
Ruslan Salakhutdinov
Dieter Fox
55
54
0
25 May 2023
Interactive Segment Anything NeRF with Feature Imitation
Xiaokang Chen
Jiaxiang Tang
Diwen Wan
Jingbo Wang
Gang Zeng
54
22
0
25 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
38
57
0
25 May 2023
On the Robustness of Segment Anything
Yihao Huang
Yue Cao
Tianlin Li
Felix Juefei Xu
Di Lin
Ivor W.Tsang
Yang Liu
Qing Guo
AAML
VLM
27
27
0
25 May 2023
Sim-Suction: Learning a Suction Grasp Policy for Cluttered Environments Using a Synthetic Benchmark
Juncheng Li
D. Cappelleri
3DPC
22
11
0
25 May 2023
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Zijia Zhao
Longteng Guo
Tongtian Yue
Si-Qing Chen
Shuai Shao
Xinxin Zhu
Zehuan Yuan
Jing Liu
MLLM
40
52
0
25 May 2023
ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs
Zihao Zhao
Sheng Wang
Jinchen Gu
Yitao Zhu
Lanzhuju Mei
Zixu Zhuang
Zhiming Cui
Qian Wang
Dinggang Shen
LM&MA
34
36
0
25 May 2023
Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback
Yiqi Lin
Hao Wu
Ruichen Wang
H. Lu
Xiaodong Lin
Hui Xiong
Lin Wang
3DV
45
12
0
25 May 2023
POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference
Zhiwen Fan
Pan Pan
Peihao Wang
Yi Ding
Dejia Xu
Hanwen Jiang
Zhangyang Wang
37
24
0
25 May 2023
L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
Zheng Chang
Shuchen Weng
Pei Zhang
Yu Li
Si Li
Boxin Shi
DiffM
21
7
0
24 May 2023
InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360-degree Neural Radiance Fields
Dongqing Wang
Tong Zhang
Alaa Abboud
Sabine Süsstrunk
40
12
0
24 May 2023
PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology
Yuxuan Sun
Chenglu Zhu
S. Zheng
Kai Zhang
Xiaoxuan Yu
Zhongyi Shui
Yunlong Zhang
Honglin Li
Lin Yang
LM&MA
MedIm
8
42
0
24 May 2023
DC-Net: Divide-and-Conquer for Salient Object Detection
Jiayi Zhu
Xuebin Qin
Abdulmotaleb Elsaddik
32
11
0
24 May 2023
Polarimetric Imaging for Perception
Michael Baltaxe
Tomer Peér
Dan Levi
25
2
0
24 May 2023
SAD: Segment Any RGBD
Jun Cen
Yizhe Wu
Kewei Wang
Xingyi Li
Jingkang Yang
Yixuan Pei
Lingdong Kong
Ziwei Liu
Qifeng Chen
34
14
0
23 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
52
65
0
23 May 2023
VisorGPT: Learning Visual Prior via Generative Pre-Training
Jinheng Xie
Kai Ye
Yudong Li
Yuexiang Li
Kevin Qinghong Lin
Yefeng Zheng
Linlin Shen
Mike Zheng Shou
ViT
119
8
0
23 May 2023
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian
Boyi Li
Adam Yala
Trevor Darrell
43
152
0
23 May 2023
A Dive into SAM Prior in Image Restoration
Zeyu Xiao
Jiawang Bai
Zhihe Lu
Zhiwei Xiong
29
16
0
23 May 2023
VDD: Varied Drone Dataset for Semantic Segmentation
Wenxiao Cai
Ke Jin
Jinyan Hou
Cong Guo
Letian Wu
Wankou Yang
48
11
0
23 May 2023
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Yang Liu
Muzhi Zhu
Hengtao Li
Hao Chen
Xinlong Wang
Chunhua Shen
VLM
MLLM
88
84
0
22 May 2023
Restore Anything Pipeline: Segment Anything Meets Image Restoration
Jiaxi Jiang
Christian Holz
VLM
29
8
0
22 May 2023
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration
Qifan Yu
Juncheng Li
Wentao Ye
Siliang Tang
Yueting Zhuang
36
13
0
22 May 2023
UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Zhenghao Zhang
Shengfan Zhang
Zhichao Wei
Zuozhuo Dai
Siyu Zhu
VOS
VLM
25
16
0
22 May 2023
Previous
1
2
3
...
79
80
81
82
83
84
Next