Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.01567
Cited By
Segment Anything in High Quality
2 June 2023
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
F. I. F. Richard Yu
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Segment Anything in High Quality"
50 / 206 papers shown
Title
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Henghui Ding
Chang Liu
Yunchao Wei
Nikhila Ravi
Shuting He
...
Bo-Lu Zhao
Jing Liu
Feiyu Pan
Hao Fang
Xiankai Lu
48
8
0
24 Jun 2024
2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Bin Cao
Yisi Zhang
Xuanxu Lin
Xingjian He
Bo-Lu Zhao
Jing Liu
50
2
0
20 Jun 2024
Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies
Haojie Huang
Karl Schmeckpeper
Dian Wang
Ondrej Biza
Yaoyao Qian
Haotian Liu
Mingxi Jia
Robert Platt
Robin Walters
VGen
LM&Ro
29
6
0
17 Jun 2024
RobustSAM: Segment Anything Robustly on Degraded Images
Wei-Ting Chen
Yu-Jiet Vong
Sy-Yen Kuo
Sizhuo Ma
Jian Wang
VLM
46
9
0
13 Jun 2024
Training-Free Robust Interactive Video Object Segmentation
Xiaoli Wei
Zhaoqing Wang
Yandong Guo
Chunxia Zhang
Tongliang Liu
Mingming Gong
VLM
VOS
30
1
0
08 Jun 2024
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
29
22
0
06 Jun 2024
Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
Pedro Miraldo
Suhas Lohit
Moitreya Chatterjee
3DGS
25
4
0
06 Jun 2024
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
An-Chieh Cheng
Hongxu Yin
Yang Fu
Qiushan Guo
Ruihan Yang
Jan Kautz
Xiaolong Wang
Sifei Liu
LRM
48
43
0
03 Jun 2024
Hyper-Transformer for Amodal Completion
Jianxiong Gao
Xuelin Qian
Longfei Liang
Junwei Han
Yanwei Fu
ViT
36
1
0
30 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
48
4
0
28 May 2024
How Much You Ate? Food Portion Estimation on Spoons
Aaryam Sharma
Chris Czarnecki
Yuhao Chen
Pengcheng Xi
Linlin Xu
Alexander Wong
13
1
0
12 May 2024
Structured Click Control in Transformer-based Interactive Segmentation
Long Xu
Yong-Xiang Chen
Rui Huang
Feng Wu
Shiwu Lai
19
0
0
07 May 2024
PTQ4SAM: Post-Training Quantization for Segment Anything
Chengtao Lv
Hong Chen
Jinyang Guo
Yifu Ding
Xianglong Liu
VLM
MQ
26
13
0
06 May 2024
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Wen-Hsuan Chu
Lei Ke
Katerina Fragkiadaki
3DGS
VGen
21
29
0
03 May 2024
MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model
Rajat Sahay
Andreas E. Savakis
MoE
36
0
0
01 May 2024
ASAM: Boosting Segment Anything Model with Adversarial Tuning
Bo Li
Haoke Xiao
Lv Tang
22
9
0
01 May 2024
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
Zhangjing Yang
Dun Liu
Wensheng Cheng
Jinqiao Wang
Yi Wu
VLM
29
2
0
22 Apr 2024
Interpreting COVID Lateral Flow Tests' Results with Foundation Models
Stuti Pandey
Josh Myers-Dean
Jarek Reynolds
Danna Gurari
34
0
0
21 Apr 2024
Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images
Nikolaos Dionelis
Francesco Pro
Luca Maiano
Irene Amerini
B. L. Saux
34
3
0
17 Apr 2024
VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection
Bonan Ding
Jin Xie
Jing Nie
Jiale Cao
24
2
0
15 Apr 2024
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Junchi Wang
Lei Ke
MLLM
LRM
VLM
36
18
0
12 Apr 2024
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
40
16
0
12 Apr 2024
Adapting the Segment Anything Model During Usage in Novel Situations
Robin Schon
Julian Lorenz
K. Ludwig
Rainer Lienhart
VLM
29
7
0
12 Apr 2024
Practical Region-level Attack against Segment Anything Models
Yifan Shen
Zhengyuan Li
Gang Wang
VLM
33
9
0
12 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao-Yu Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
70
2
0
06 Apr 2024
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Steven G. McDonagh
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
37
10
0
03 Apr 2024
Unsegment Anything by Simulating Deformation
Jiahao Lu
Xingyi Yang
Xinchao Wang
29
4
0
03 Apr 2024
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts
Qin Liu
Jaemin Cho
Mohit Bansal
Marc Niethammer
VLM
25
10
0
31 Mar 2024
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
Sang-Kee Jo
Soohyun Ryu
Sungyub Kim
Eunho Yang
Kyungsu Kim
35
1
0
30 Mar 2024
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sang-Kee Jo
Fei Pan
In-Jae Yu
Kyungsu Kim
25
2
0
30 Mar 2024
Efficient 3D Instance Mapping and Localization with Neural Fields
George Tang
Krishna Murthy Jatavallabhula
Antonio Torralba
ISeg
32
5
0
28 Mar 2024
SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects
Avinash Ummadisingu
Jongkeum Choi
Koki Yamane
Shimpei Masuda
Naoki Fukaya
Kuniyuki Takahashi
55
2
0
28 Mar 2024
Annolid: Annotate, Segment, and Track Anything You Need
Chen Yang
Thomas A. Cleland
VOS
21
2
0
27 Mar 2024
A Roadmap Towards Automated and Regulated Robotic Systems
Yihao Liu
Mehran Armand
36
2
0
21 Mar 2024
Opti-Acoustic Semantic SLAM with Unknown Objects in Underwater Environments
Kurran Singh
Jungseok Hong
Nick Rypkema
John J. Leonard
39
1
0
19 Mar 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Siyuan Huang
Iaroslav Ponomarenko
Zhengkai Jiang
Xiaoqi Li
Xiaobin Hu
Peng Gao
Hongsheng Li
Hao Dong
LM&Ro
32
16
0
17 Mar 2024
WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images
Hong Liu
Haosen Yang
P. Diest
J. Pluim
M. Veta
VLM
27
6
0
14 Mar 2024
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything
Zijian Wu
Adam Schmidt
Peter Kazanzides
Septimiu E. Salcudean
35
2
0
12 Mar 2024
Reframe Anything: LLM Agent for Open World Video Reframing
Jiawang Cao
Yongliang Wu
Weiheng Chi
Wenbo Zhu
Ziyue Su
Jay Wu
31
3
0
10 Mar 2024
R
2
\text{R}^2
R
2
-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Xiang Li
Kai Qiu
Jinglu Wang
Xiaohao Xu
Rita Singh
Kashu Yamazaki
Hao Chen
Xiaonan Huang
Bhiksha Raj
VOS
32
1
0
07 Mar 2024
VRP-SAM: SAM with Visual Reference Prompt
Yanpeng Sun
Jiahui Chen
Shan Zhang
Xinyu Zhang
Qiang Chen
Gang Zhang
Errui Ding
Jingdong Wang
Zechao Li
42
31
0
27 Feb 2024
RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation
Hanxiao Jiang
Binghao Huang
Ruihai Wu
Zhuoran Li
Shubham Garg
H. Nayyeri
Shenlong Wang
Yunzhu Li
29
17
0
23 Feb 2024
OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding
Guibiao Liao
Kaichen Zhou
Zhenyu Bao
Kanglin Liu
Qing Li
VLM
13
20
0
07 Feb 2024
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model
Aoran Xiao
Weihao Xuan
Heli Qi
Yun Xing
Ruijie Ren
Xiaoqin Zhang
Ling Shao
Shijian Lu
VLM
MLLM
35
10
0
06 Feb 2024
Region-Based Representations Revisited
Michal Shlapentokh-Rothman
Ansel Blume
Yao Xiao
Yuqun Wu
TV Sethuraman
Heyi Tao
Jae Yong Lee
Wilfredo Torres
Yu-xiong Wang
Derek Hoiem
32
5
0
04 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
35
10
0
31 Jan 2024
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
Xu Hu
Yuxi Wang
Lue Fan
Junsong Fan
Junran Peng
Zhen Lei
Qing Li
Zhaoxiang Zhang
Zhaoxiang Zhang
3DGS
42
8
0
31 Jan 2024
Repositioning the Subject within Image
Yikai Wang
Chenjie Cao
Ke Fan
Qiaole Dong
Yifan Li
Xiangyang Xue
Yanwei Fu
DiffM
32
1
0
30 Jan 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie-jin Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
35
373
0
25 Jan 2024
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
Zhaozhi Xie
Bochen Guan
Weihao Jiang
Muyang Yi
Yue Ding
Hongtao Lu
Lei Zhang
VLM
31
13
0
23 Jan 2024
Previous
1
2
3
4
5
Next