ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.02777
  4. Cited By
Mask DINO: Towards A Unified Transformer-based Framework for Object
  Detection and Segmentation

Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation

6 June 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
    ISeg
ArXivPDFHTML

Papers citing "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

50 / 230 papers shown
Title
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road
  Topology Problem
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem
M. E. Kalfaoglu
H. Öztürk
Ozsel Kilinc
A. Temi̇zel
3DPC
27
2
0
17 Sep 2024
A Likelihood Ratio-Based Approach to Segmenting Unknown Objects
A Likelihood Ratio-Based Approach to Segmenting Unknown Objects
Nazir Nayal
Youssef Shoeb
Fatma Güney
OODD
25
4
0
10 Sep 2024
INTRA: Interaction Relationship-aware Weakly Supervised Affordance
  Grounding
INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding
Ji Ha Jang
H. Seo
Se Young Chun
32
2
0
10 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
34
1
0
04 Sep 2024
A Simple and Generalist Approach for Panoptic Segmentation
A Simple and Generalist Approach for Panoptic Segmentation
Nedyalko Prisadnikov
Wouter Van Gansbeke
Danda Pani Paudel
Luc Van Gool
VLM
38
0
0
29 Aug 2024
A Review of Transformer-Based Models for Computer Vision Tasks:
  Capturing Global Context and Spatial Relationships
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
30
7
0
27 Aug 2024
VPOcc: Exploiting Vanishing Point for Monocular 3D Semantic Occupancy
  Prediction
VPOcc: Exploiting Vanishing Point for Monocular 3D Semantic Occupancy Prediction
Junsu Kim
Junhee Lee
Ukcheol Shin
Jean Oh
Kyungdon Joo
3DPC
29
0
0
07 Aug 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
31
3
0
31 Jul 2024
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
41
23
0
29 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
31
1
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
58
1
0
23 Jul 2024
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View
  Segmentation Masks
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
Sehwan Choi
Jungho Kim
Hongjae Shin
Jungwook Choi
3DPC
44
7
0
18 Jul 2024
IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild
IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild
Shuaixian Wang
Haoran Xu
Yaokun Li
Jiwei Chen
Guang Tan
16
2
0
15 Jul 2024
A Fair Ranking and New Model for Panoptic Scene Graph Generation
A Fair Ranking and New Model for Panoptic Scene Graph Generation
Julian Lorenz
Alexander Pest
Daniel Kienzle
K. Ludwig
Rainer Lienhart
41
1
0
12 Jul 2024
Anatomy-guided Pathology Segmentation
Anatomy-guided Pathology Segmentation
A. Jaus
C. Seibold
Simon Reiß
Lukas Heine
Anton Schily
Moon Kim
F. Bahnsen
Ken Herrmann
Rainer Stiefelhagen
Jens Kleesiek
MedIm
26
2
0
08 Jul 2024
Improving Computer Vision Interpretability: Transparent Two-level
  Classification for Complex Scenes
Improving Computer Vision Interpretability: Transparent Two-level Classification for Complex Scenes
Stefan Scholz
Nils B. Weidmann
Zachary C. Steinert-Threlkeld
Eda Keremoğlu
Bastian Goldlücke
27
1
0
04 Jul 2024
Label-free Neural Semantic Image Synthesis
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
25
0
0
01 Jul 2024
Rethinking Remote Sensing Change Detection With A Mask View
Rethinking Remote Sensing Change Detection With A Mask View
Xiaowen Ma
Zhenkai Wu
Rongrong Lian
Wei Zhang
Siyang Song
24
3
0
21 Jun 2024
Liveness Detection in Computer Vision: Transformer-based Self-Supervised
  Learning for Face Anti-Spoofing
Liveness Detection in Computer Vision: Transformer-based Self-Supervised Learning for Face Anti-Spoofing
Arman Keresh
Pakizar Shamoi
31
5
0
19 Jun 2024
Technique Report of CVPR 2024 PBDL Challenges
Technique Report of CVPR 2024 PBDL Challenges
Ying Fu
Yu Li
Shaodi You
Boxin Shi
Linwei Chen
...
Songyin Dai
Sen Jia
Junpei Zhang
Puhua Chen
Qihang Li
33
0
0
15 Jun 2024
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor
  Segmentation
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation
Zhongzhen Huang
Yankai Jiang
Rongzhao Zhang
Shaoting Zhang
Xiaofan Zhang
MedIm
62
4
0
11 Jun 2024
CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks
  with Front-End UI Only
CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only
Junhee Cho
Jihoon Kim
Daseul Bae
Jinho Choo
Youngjune Gwon
Yeong-Dae Kwon
LLMAG
23
1
0
11 Jun 2024
ProMotion: Prototypes As Motion Learners
ProMotion: Prototypes As Motion Learners
Yawen Lu
Dongfang Liu
Qifan Wang
Cheng Han
Yiming Cui
Zhiwen Cao
Xueling Zhang
Yingjie Victor Chen
Heng Fan
DiffM
30
2
0
07 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Lu Yang
Pu Cao
Liulei Li
Huadong Ma
31
1
0
06 Jun 2024
Extreme Point Supervised Instance Segmentation
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee
S. Hwang
Suha Kwak
19
2
0
31 May 2024
BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation
BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation
Tengbo Wang
Yu Bai
ISeg
37
0
0
27 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
55
0
0
15 May 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout
  Analysis
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
43
1
0
13 May 2024
Enhancing DETRs Variants through Improved Content Query and Similar
  Query Aggregation
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
32
2
0
06 May 2024
Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic
  Labeling using Foundation Models
Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models
Mohamad Al Al Mdfaa
Raghad Salameh
Sergey Zagoruyko
Gonzalo Ferrer
19
0
0
03 May 2024
Multi-method Integration with Confidence-based Weighting for Zero-shot
  Image Classification
Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification
Siqi Yin
Lifan Jiang
22
0
0
03 May 2024
GraCo: Granularity-Controllable Interactive Segmentation
GraCo: Granularity-Controllable Interactive Segmentation
Yian Zhao
Kehan Li
Ze-Long Cheng
Pengchong Qiao
Xiawu Zheng
Rongrong Ji
Chang Liu
Li-ming Yuan
Jie Chen
31
9
0
01 May 2024
UniFS: Universal Few-shot Instance Perception with Point Representations
UniFS: Universal Few-shot Instance Perception with Point Representations
Sheng Jin
Ruijie Yao
Lumin Xu
Wentao Liu
Chao Qian
Ji Wu
Ping Luo
40
2
0
30 Apr 2024
UniRGB-IR: A Unified Framework for Visible-Infrared Semantic Tasks via Adapter Tuning
UniRGB-IR: A Unified Framework for Visible-Infrared Semantic Tasks via Adapter Tuning
Maoxun Yuan
Bo Cui
Tianyi Zhao
Xingxing Wei
Shan Fu
Xue Yang
Xingxing Wei
35
0
0
26 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
51
0
0
23 Apr 2024
CarcassFormer: An End-to-end Transformer-based Framework for
  Simultaneous Localization, Segmentation and Classification of Poultry Carcass
  Defect
CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect
Minh Q. Tran
Sang Truong
Arthur F. A. Fernandes
Michael Kidd
Ngan Le
ViT
21
2
0
17 Apr 2024
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually
  Expanding Large Vocabularies
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Zhongrui Gui
Shuyang Sun
Runjia Li
Jianhao Yuan
Zhaochong An
Karsten Roth
Ameya Prabhu
Philip H. S. Torr
VLM
CLL
24
6
0
15 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
34
14
0
12 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao-Yu Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
70
2
0
06 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
66
2
0
04 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic
  Dataset in Crowded Human Environments
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
24
2
0
02 Apr 2024
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Beomyoung Kim
Donghyeon Kim
Sung Ju Hwang
27
0
0
01 Apr 2024
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with
  Visual Prompt Tuning
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
Beomyoung Kim
Joonsang Yu
Sung Ju Hwang
VLM
CLL
18
10
0
29 Mar 2024
Leveraging Large Language Model-based Room-Object Relationships
  Knowledge for Enhancing Multimodal-Input Object Goal Navigation
Leveraging Large Language Model-based Room-Object Relationships Knowledge for Enhancing Multimodal-Input Object Goal Navigation
Leyuan Sun
Asako Kanezaki
Guillaume Caron
Yusuke Yoshiyasu
LM&Ro
21
2
0
21 Mar 2024
Video Object Segmentation with Dynamic Query Modulation
Video Object Segmentation with Dynamic Query Modulation
Hantao Zhou
Runze Hu
Xiu Li
VOS
38
1
0
18 Mar 2024
Endora: Video Generation Models as Endoscopy Simulators
Endora: Video Generation Models as Endoscopy Simulators
Chenxin Li
Hengyu Liu
Yifan Liu
Brandon Yushan Feng
Wuyang Li
Xinyu Liu
Zhen Chen
Jing Shao
Yixuan Yuan
VGen
MedIm
80
33
0
17 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
31
6
0
14 Mar 2024
MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences
  using Attention-based Temporal Fusion
MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion
Arul Selvam Periyasamy
Sven Behnke
3DPC
22
0
0
14 Mar 2024
Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs
Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs
Nikhil Mishra
Maximilian Sieb
Pieter Abbeel
Xi Chen
3DPC
33
1
0
07 Mar 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
33
16
0
28 Feb 2024
Previous
12345
Next