Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.02777
Cited By
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
6 June 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
30 / 230 papers shown
Title
Vision Transformers Are Good Mask Auto-Labelers
Shiyi Lan
Xitong Yang
Zhiding Yu
Zuxuan Wu
J. Álvarez
Anima Anandkumar
ISeg
ViT
MedIm
13
19
0
10 Jan 2023
On Advantages of Mask-level Recognition for Outlier-aware Segmentation
Matej Grcić
Josip vSarić
Sinivsa vSegvić
UQCV
79
34
0
09 Jan 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
27
16
0
03 Jan 2023
Interactive Segmentation of Radiance Fields
Rahul Goel
Dhawal Sirikonda
Saurabh Saini
P. J. Narayanan
21
49
0
27 Dec 2022
Planning-oriented Autonomous Driving
Yi Hu
Jiazhi Yang
Li Chen
Keyu Li
Chonghao Sima
...
Xiaosong Jia
Qiang Liu
Jifeng Dai
Yu Qiao
Hongyang Li
33
585
0
20 Dec 2022
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Yawar Siddiqui
Lorenzo Porzi
Samuel Rota Buló
Norman Muller
Matthias Nießner
Angela Dai
Peter Kontschieder
17
128
0
19 Dec 2022
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Jishnu Mukhoti
Tsung-Yu Lin
Omid Poursaeed
Rui Wang
Ashish Shah
Philip H. S. Torr
Ser-Nam Lim
VLM
27
79
0
09 Dec 2022
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
Hanqing Zhao
Dianmo Sheng
Jianmin Bao
Dongdong Chen
Dong Chen
...
Ce Liu
Wenbo Zhou
Qi Chu
Weiming Zhang
Neng H. Yu
VLM
DiffM
28
39
0
07 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
14
65
0
06 Dec 2022
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Wentong Li
Wenyu Liu
Jianke Zhu
Miaomiao Cui
Risheng Yu
Xia Hua
Lei Zhang
ISeg
19
30
0
03 Dec 2022
CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation
Yang You
Wenhao He
Jin Liu
Hongkai Xiong
Weiming Wang
Cewu Lu
3DPC
25
3
0
24 Nov 2022
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
Jiaru Jia
Ming-Yu Liu
Jiake Xie
Xin Chen
Hong Zhang
Xin Jiang
Aiqing Yang
14
0
0
21 Nov 2022
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li
Jinguo Zhu
Xiaohu Jiang
Xizhou Zhu
Hongsheng Li
...
Xiaohua Wang
Yu Qiao
Xiaogang Wang
Wenhai Wang
Jifeng Dai
MLLM
13
55
0
17 Nov 2022
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Weijie Su
Xizhou Zhu
Chenxin Tao
Lewei Lu
Bin Li
Gao Huang
Yu Qiao
Xiaogang Wang
Jie Zhou
Jifeng Dai
26
41
0
17 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
49
671
0
14 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
12
317
0
10 Nov 2022
Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Dominik Filipiak
Andrzej Zapala
Piotr Tempczyk
A. Fensel
Marek Cygan
ISeg
13
9
0
07 Nov 2022
Evaluating Novel Mask-RCNN Architectures for Ear Mask Segmentation
Saurav K. Aryal
Teanna Barrett
Gloria J. Washington
11
2
0
05 Nov 2022
A Generalist Framework for Panoptic Segmentation of Images and Videos
Ting-Li Chen
Lala Li
Saurabh Saxena
Geoffrey E. Hinton
David J. Fleet
VGen
MLLM
17
102
0
12 Oct 2022
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Hongyang Li
Chonghao Sima
Jifeng Dai
Wenhai Wang
Lewei Lu
...
Xiaosong Jia
Siqian Liu
Jianping Shi
Dahua Lin
Yu Qiao
88
138
0
12 Sep 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLM
VLM
ViT
11
625
0
22 Aug 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
28
116
0
26 Jul 2022
DETRs with Hybrid Matching
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
19
179
0
26 Jul 2022
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Yixuan Wei
Han Hu
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Jianmin Bao
Dong Chen
B. Guo
CLIP
80
123
0
27 May 2022
Vision Transformer Adapter for Dense Predictions
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
25
537
0
17 May 2022
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Haoyu He
Jianfei Cai
Zizheng Pan
Jing Liu
Jing Zhang
Dacheng Tao
Bohan Zhuang
29
16
0
04 Apr 2022
Panoptic SwiftNet: Pyramidal Fusion for Real-time Panoptic Segmentation
Josip Saric
Marin Orsic
Sinisa Segvic
19
8
0
15 Mar 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
Feng Li
Hao Zhang
Shi-guang Liu
Jian Guo
L. Ni
Lei Zhang
ViT
25
636
0
02 Mar 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu
Feng Li
Hao Zhang
X. Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
138
703
0
28 Jan 2022
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Golnaz Ghiasi
Yin Cui
A. Srinivas
Rui Qian
Tsung-Yi Lin
E. D. Cubuk
Quoc V. Le
Barret Zoph
ISeg
223
962
0
13 Dec 2020
Previous
1
2
3
4
5