Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.05499
Cited By
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
9 March 2023
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
Jie-jin Yang
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
35 / 1,335 papers shown
Title
DetGPT: Detect What You Need via Reasoning
Renjie Pi
Jiahui Gao
Shizhe Diao
Rui Pan
Hanze Dong
...
Lewei Yao
Jianhua Han
Hang Xu
Lingpeng Kong Tong Zhang
Tong Zhang
LRM
LM&Ro
22
92
0
23 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
39
65
0
23 May 2023
Cross3DVG: Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Taiki Miyanishi
Daich Azuma
Shuhei Kurita
M. Kawanabe
28
2
0
23 May 2023
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Yang Liu
Muzhi Zhu
Hengtao Li
Hao Chen
Xinlong Wang
Chunhua Shen
VLM
MLLM
86
83
0
22 May 2023
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration
Qifan Yu
Juncheng Li
Wentao Ye
Siliang Tang
Yueting Zhuang
25
13
0
22 May 2023
Going Denser with Open-Vocabulary Part Segmentation
Pei Sun
Shoufa Chen
Chenchen Zhu
Fanyi Xiao
Ping Luo
Saining Xie
Zhicheng Yan
ObjD
VLM
20
45
0
18 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLM
MLLM
ObjD
16
114
0
18 May 2023
Segment Any Anomaly without Training via Hybrid Prompt Regularization
Yunkang Cao
Xiaohao Xu
Chen Sun
Y. Cheng
Zongwei Du
Liang Gao
Weiming Shen
VLM
26
69
0
18 May 2023
OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields
Youtan Yin
Zhoujie Fu
Fan Yang
Guosheng Lin
35
29
0
17 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
23
32
0
15 May 2023
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Zhimin Chen
Longlong Jing
Yingwei Li
Bing Li
24
31
0
15 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
32
89
0
14 May 2023
A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering
Chaoning Zhang
Fachrina Dewi Puspitasari
Sheng Zheng
Chenghao Li
Yu Qiao
...
Caiyan Qin
François Rameau
Lik-Hang Lee
Sung-Ho Bae
Choong Seon Hong
VLM
76
62
0
12 May 2023
Segment and Track Anything
Yangming Cheng
Liulei Li
Yuanyou Xu
Xiaodi Li
Zongxin Yang
Wenguan Wang
Yi Yang
VOS
20
193
0
11 May 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu
Yinan He
Wenhai Wang
Weiyun Wang
Yi Wang
...
Yali Wang
Limin Wang
Ping Luo
Jifeng Dai
Yu Qiao
LRM
MLLM
12
79
0
09 May 2023
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model
Di Wang
Jing Zhang
Bo Du
Minqiang Xu
Lin Liu
Dacheng Tao
L. Zhang
114
69
0
03 May 2023
An Alternative to WSSS? An Empirical Study of the Segment Anything Model (SAM) on Weakly-Supervised Semantic Segmentation Problems
Weixuan Sun
Zheyuan Liu
Yanhao Zhang
Yiran Zhong
Nick Barnes
VLM
69
20
0
02 May 2023
Attack-SAM: Towards Attacking Segment Anything Model With Adversarial Examples
Chenshuang Zhang
Chaoning Zhang
Taegoo Kang
Donghun Kim
Sung-Ho Bae
In So Kweon
AAML
VLM
35
3
0
01 May 2023
Learnable Ophthalmology SAM
Zhongxi Qiu
Yan Hu
Heng Li
Jiang-Dong Liu
VLM
MedIm
24
24
0
26 Apr 2023
Segment Anything in 3D with Radiance Fields
Jiazhong Cen
Jiemin Fang
Zanwei Zhou
Chen Yang
Lingxi Xie
Xiaopeng Zhang
Wei-Ming Shen
Qi Tian
36
43
0
24 Apr 2023
Segment Anything in Non-Euclidean Domains: Challenges and Opportunities
Yongcheng Jing
Xinchao Wang
Dacheng Tao
25
21
0
23 Apr 2023
Can SAM Count Anything? An Empirical Study on SAM Counting
Zhiheng Ma
Xiaopeng Hong
Qinnan Shangguan
VLM
22
18
0
21 Apr 2023
Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation Models
Jielu Zhang
Zhongliang Zhou
Gengchen Mai
Lan Mu
Mengxuan Hu
Sheng R. Li
VLM
26
44
0
20 Apr 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
40
4,236
0
17 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
77
79
0
13 Apr 2023
SATR: Zero-Shot Semantic Segmentation of 3D Shapes
Ahmed Abdelreheem
Ivan Skorokhodov
M. Ovsjanikov
Peter Wonka
3DPC
25
38
0
11 Apr 2023
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Ruiqi Wang
A. Patil
Fenggen Yu
Hao Zhang
13
1
0
21 Mar 2023
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Hsuan-Kung Yang
Tsung-Chih Chiang
Tingxin Liu
Chun-Wei Huang
Jou-Min Liu
Tsu-Ching Hsiao
Chun-Yi Lee
21
1
0
05 Mar 2023
Explainable Anomaly Detection in Images and Videos: A Survey
Yizhou Wang
Dongliang Guo
Sheng R. Li
Octavia Camps
Yun Fu
16
5
0
13 Feb 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
Yue Wu
Yewen Fan
Paul Pu Liang
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
OffRL
11
47
0
09 Feb 2023
CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation
Yang You
Wenhao He
Jin Liu
Hongkai Xiong
Weiming Wang
Cewu Lu
3DPC
25
3
0
24 Nov 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIP
VLM
115
151
0
20 Sep 2022
Complex Scene Image Editing by Scene Graph Comprehension
Zhongping Zhang
Huiwen He
Bryan A. Plummer
Z. Liao
Huayan Wang
DiffM
20
6
0
24 Mar 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu
Feng Li
Hao Zhang
X. Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
138
725
0
28 Jan 2022
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Tsung-Yi Lin
Weicheng Kuo
Yin Cui
VLM
ObjD
223
897
0
28 Apr 2021
Previous
1
2
3
...
25
26
27