ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.02777
  4. Cited By
Mask DINO: Towards A Unified Transformer-based Framework for Object
  Detection and Segmentation

Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation

6 June 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
    ISeg
ArXivPDFHTML

Papers citing "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

50 / 230 papers shown
Title
The All-Seeing Project: Towards Panoptic Visual Recognition and
  Understanding of the Open World
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Weiyun Wang
Min Shi
Qingyun Li
Wen Wang
Zhenhang Huang
...
Zhiguo Cao
Yushi Chen
Tong Lu
Jifeng Dai
Yu Qiao
LRM
MLLM
33
83
0
03 Aug 2023
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
Wentong Li
Yu-Jie Yuan
Song Wang
Jianke Zhu
Jianshu Li
Jian Liu
Lei Zhang
3DPC
OT
25
19
0
03 Aug 2023
Revisiting DETR Pre-training for Object Detection
Revisiting DETR Pre-training for Object Detection
Yan Ma
Weicong Liang
Bo-Ying Chen
Yiduo Hao
Bojian Hou
Xiangyu Yue
Chao Zhang
Yuhui Yuan
VLM
ViT
25
4
0
02 Aug 2023
GEM: Boost Simple Network for Glass Surface Segmentation via Vision
  Foundation Models
GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation Models
Jing Hao
Xinyu Li
Liang Gao
Shumin Han
VLM
DiffM
16
2
0
22 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
19
32
0
18 Jul 2023
OG: Equip vision occupancy with instance segmentation and visual
  grounding
OG: Equip vision occupancy with instance segmentation and visual grounding
Zichao Dong
Hang Ji
Weikun Zhang
Xufeng Huang
Junbo Chen
ISeg
VLM
18
0
0
12 Jul 2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Feng Li
Hao Zhang
Pei Sun
Xueyan Zou
Siyi Liu
Jianwei Yang
Chun-yue Li
Lei Zhang
Jianfeng Gao
VLM
16
171
0
10 Jul 2023
EffSeg: Efficient Fine-Grained Instance Segmentation using
  Structure-Preserving Sparsity
EffSeg: Efficient Fine-Grained Instance Segmentation using Structure-Preserving Sparsity
Cédric Picron
Tinne Tuytelaars
ISeg
15
0
0
04 Jul 2023
AVSegFormer: Audio-Visual Segmentation with Transformer
AVSegFormer: Audio-Visual Segmentation with Transformer
Sheng Gao
Zhe Chen
Guo Chen
Wenhai Wang
Tong Lu
VOS
21
45
0
03 Jul 2023
Hierarchical Open-vocabulary Universal Image Segmentation
Hierarchical Open-vocabulary Universal Image Segmentation
Xudong Wang
Shufang Li
Konstantinos Kallidromitis
Yu Kato
Kazuki Kozuka
Trevor Darrell
VLM
OCL
30
36
0
03 Jul 2023
ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation
ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation
Shuyang Sun
Weijun Wang
Qihang Yu
Andrew G. Howard
Philip H. S. Torr
Liang-Chieh Chen
24
15
0
29 Jun 2023
Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
Hao Jiang
Tianheng Cheng
Naiyu Gao
Haoyang Zhang
Tianwei Lin
Wenyu Liu
Xinggang Wang
18
55
0
27 Jun 2023
MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction
MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction
Limeng Qiao
Yongchao Zheng
Peng Zhang
Wenjie Ding
Xi Qiu
Xing Wei
Chi Zhang
3DGS
3DPC
3DV
24
15
0
17 Jun 2023
Robustness Analysis on Foundational Segmentation Models
Robustness Analysis on Foundational Segmentation Models
Madeline Chantry Schiappa
Shehreen Azad
V. Sachidanand
Yunhao Ge
O. Mikšík
Y. S. Rawat
Vibhav Vineet
OOD
VLM
AAML
14
5
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
29
6
0
14 Jun 2023
VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON
VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON
Haoping Bai
Shancong Mou
Tatiana Likhomanenko
R. G. Cinbis
Oncel Tuzel
Ping-Chia Huang
Jiulong Shan
Jianjun Shi
Mengsi Cao
VLM
8
23
0
13 Jun 2023
detrex: Benchmarking Detection Transformers
detrex: Benchmarking Detection Transformers
Tianhe Ren
Siyi Liu
Feng Li
Hao Zhang
Ailing Zeng
...
Zhaoyang Zeng
Xianbiao Qi
Yuhui Yuan
Jianwei Yang
Lei Zhang
20
13
0
12 Jun 2023
SegViTv2: Exploring Efficient and Continual Semantic Segmentation with
  Plain Vision Transformers
SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers
Bowen Zhang
Liyang Liu
Minh Hieu Phan
Zhi Tian
Chunhua Shen
Yifan Liu
ViT
19
28
0
09 Jun 2023
RefineVIS: Video Instance Segmentation with Temporal Attention
  Refinement
RefineVIS: Video Instance Segmentation with Temporal Attention Refinement
Andre Abrantes
Jiang Wang
Peng Chu
Quanzeng You
Zicheng Liu
VOS
8
0
0
07 Jun 2023
Industrial Anomaly Detection and Localization Using Weakly-Supervised
  Residual Transformers
Industrial Anomaly Detection and Localization Using Weakly-Supervised Residual Transformers
Hanxi Li
Jing Wu
Lin Yuanbo Wu
Hao Chen
Deyin Liu
Mingwen Wang
Peng Wang
ViT
32
4
0
06 Jun 2023
Interactive Segment Anything NeRF with Feature Imitation
Interactive Segment Anything NeRF with Feature Imitation
Xiaokang Chen
Jiaxiang Tang
Diwen Wan
Jingbo Wang
Gang Zeng
29
22
0
25 May 2023
ICDAR 2023 Competition on Robust Layout Segmentation in Corporate
  Documents
ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents
Christoph Auer
A. Nassar
Maksym Lysak
Michele Dolfi
Nikolaos Livathinos
Peter W. J. Staar
OOD
3DV
22
6
0
24 May 2023
Instance-Aware Repeat Factor Sampling for Long-Tailed Object Detection
Instance-Aware Repeat Factor Sampling for Long-Tailed Object Detection
Burhaneddin Yaman
Tanvir Mahmud
Chun-Hao Liu
13
4
0
14 May 2023
Self-Supervised Instance Segmentation by Grasping
Self-Supervised Instance Segmentation by Grasping
YuXuan Liu
Xi Chen
Pieter Abbeel
17
6
0
10 May 2023
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for
  Document Instance Segmentation
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Ayan Banerjee
Sanket Biswas
Josep Lladós
Umapada Pal
ViT
4
16
0
08 May 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
37
132
0
19 Apr 2023
Boosting Semantic Segmentation with Semantic Boundaries
Boosting Semantic Segmentation with Semantic Boundaries
Haruya Ishikawa
Y. Aoki
18
4
0
19 Apr 2023
Align-DETR: Improving DETR with Simple IoU-aware BCE loss
Align-DETR: Improving DETR with Simple IoU-aware BCE loss
Zhi Cai
Songtao Liu
Guodong Wang
Zheng Ge
Xiangyu Zhang
Di Huang
21
2
0
15 Apr 2023
Segment Everything Everywhere All at Once
Segment Everything Everywhere All at Once
Xueyan Zou
Jianwei Yang
Hao Zhang
Feng Li
Linjie Li
Jianfeng Wang
Lijuan Wang
Jianfeng Gao
Yong Jae Lee
MLLM
VLM
9
453
0
13 Apr 2023
Detection Transformer with Stable Matching
Detection Transformer with Stable Matching
Siyi Liu
Tianhe Ren
Jia-Yu Chen
Zhaoyang Zeng
Hao Zhang
...
Hongyang Li
Jun Huang
Hang Su
Jun Zhu
Lei Zhang
20
33
0
10 Apr 2023
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
Cong Han
Yujie Zhong
Dengjie Li
Kai Han
Lin Ma
VLM
SSeg
6
30
0
03 Apr 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded
  Instances on Challenging Videos
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
26
9
0
25 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR
  Perception
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR Perception
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
24
10
0
21 Mar 2023
TWINS: A Fine-Tuning Framework for Improved Transferability of
  Adversarial Robustness and Generalization
TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization
Ziquan Liu
Yi Tian Xu
Xiangyang Ji
Antoni B. Chan
AAML
19
17
0
20 Mar 2023
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
Junjie He
Pengyu Li
Yifeng Geng
Xuansong Xie
ISeg
VLM
19
43
0
15 Mar 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
A Simple Framework for Open-Vocabulary Segmentation and Detection
Hao Zhang
Feng Li
Xueyan Zou
Siyi Liu
Chun-yue Li
Jianfeng Gao
Jianwei Yang
Lei Zhang
ObjD
VLM
4
149
0
14 Mar 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
MP-Former: Mask-Piloted Transformer for Image Segmentation
Hao Zhang
Feng Li
Hu-Sheng Xu
Shijia Huang
Siyi Liu
L. Ni
Lei Zhang
ViT
MedIm
11
58
0
13 Mar 2023
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Feng Li
Ailing Zeng
Siyi Liu
Hao Zhang
Hongyang Li
Lei Zhang
L. Ni
ViT
29
61
0
13 Mar 2023
Object-Centric Multi-Task Learning for Human Instances
Object-Centric Multi-Task Learning for Human Instances
Hyeongseok Son
Sang-Il Jung
Solae Lee
Seong-heum Kim
Seungsang Park
ByungIn Yoo
3DH
17
0
0
13 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set
  Object Detection
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
14
1,797
0
09 Mar 2023
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Jiarui Xu
Sifei Liu
Arash Vahdat
Wonmin Byeon
Xiaolong Wang
Shalini De Mello
VLM
209
318
0
08 Mar 2023
UniHCP: A Unified Model for Human-Centric Perceptions
UniHCP: A Unified Model for Human-Centric Perceptions
Yuanzheng Ci
Yizhou Wang
Meilin Chen
Shixiang Tang
Lei Bai
Feng Zhu
Rui Zhao
F. Yu
Donglian Qi
Wanli Ouyang
77
50
0
06 Mar 2023
Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Sora Takashima
Ryo Hayamizu
Nakamasa Inoue
Hirokatsu Kataoka
Rio Yokota
60
18
0
02 Mar 2023
ISBNet: a 3D Point Cloud Instance Segmentation Network with
  Instance-aware Sampling and Box-aware Dynamic Convolution
ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution
T.D. Ngo
Binh-Son Hua
Khoi Duc Minh Nguyen
3DPC
ISeg
13
42
0
01 Mar 2023
PUPS: Point Cloud Unified Panoptic Segmentation
PUPS: Point Cloud Unified Panoptic Segmentation
Shihao Su
Jianyun Xu
Huanyu Wang
Zhenwei Miao
Xin Zhan
Dayang Hao
Xi Li
3DPC
11
19
0
13 Feb 2023
Cross-Modal Fine-Tuning: Align then Refine
Cross-Modal Fine-Tuning: Align then Refine
Junhong Shen
Liam Li
Lucio Dery
Corey Staten
M. Khodak
Graham Neubig
Ameet Talwalkar
26
33
0
11 Feb 2023
SimCon Loss with Multiple Views for Text Supervised Semantic
  Segmentation
SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Yash J. Patel
Yusheng Xie
Yi Zhu
Srikar Appalaraju
R. Manmatha
19
4
0
07 Feb 2023
Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
Jie-jin Yang
Ailing Zeng
Siyi Liu
Feng Li
Ruimao Zhang
Lei Zhang
14
50
0
03 Feb 2023
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
Liya Wang
A. Tien
30
6
0
28 Jan 2023
Head-Free Lightweight Semantic Segmentation with Linear Transformer
Head-Free Lightweight Semantic Segmentation with Linear Transformer
B. Dong
Pichao Wang
Fan Wang
ViT
8
64
0
11 Jan 2023
Previous
12345
Next