ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.00759
  4. Cited By
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
    ViT
ArXivPDFHTML

Papers citing "MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"

50 / 119 papers shown
Title
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative
  Grounding
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding
Zixiang Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Si Liu
12
12
0
11 Aug 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision
  Transformer
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViT
MDE
29
174
0
06 Aug 2022
Transformer with Implicit Edges for Particle-based Physics Simulation
Transformer with Implicit Edges for Particle-based Physics Simulation
Yidi Shao
Chen Change Loy
Bo Dai
16
15
0
22 Jul 2022
Weakly Supervised Video Salient Object Detection via Point Supervision
Weakly Supervised Video Salient Object Detection via Point Supervision
Shuyong Gao
Hao Xing
Wei Zhang
Yan Wang
Qianyu Guo
Wenqiang Zhang
23
24
0
15 Jul 2022
Online Video Instance Segmentation via Robust Context Fusion
Online Video Instance Segmentation via Robust Context Fusion
Xiang Li
Jinglu Wang
Xiaohao Xu
Bhiksha Raj
Yan Lu
35
5
0
12 Jul 2022
SFNet: Faster and Accurate Semantic Segmentation via Semantic Flow
SFNet: Faster and Accurate Semantic Segmentation via Semantic Flow
Xiangtai Li
Jiangning Zhang
Yibo Yang
Guangliang Cheng
Kuiyuan Yang
Yu Tong
Dacheng Tao
SSeg
AI4TS
40
28
0
10 Jul 2022
Dual Decision Improves Open-Set Panoptic Segmentation
Dual Decision Improves Open-Set Panoptic Segmentation
Hainan Xu
Hao Chen
Lingqiao Liu
Yufei Yin
VLM
19
6
0
06 Jul 2022
OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers
OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers
Jialun Pei
Tianyang Cheng
Deng-Ping Fan
He Tang
Chuanbo Chen
Luc Van Gool
ViT
18
54
0
05 Jul 2022
MGNet: Monocular Geometric Scene Understanding for Autonomous Driving
MGNet: Monocular Geometric Scene Understanding for Autonomous Driving
Markus Schön
M. Buchholz
Klaus C. J. Dietmayer
3DPC
3DGS
28
40
0
27 Jun 2022
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu
Huiyu Wang
Dahun Kim
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
MedIm
32
89
0
17 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object
  Detection and Segmentation
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
42
366
0
06 Jun 2022
Decoder Denoising Pretraining for Semantic Segmentation
Decoder Denoising Pretraining for Semantic Segmentation
Emmanuel B. Asiedu
Simon Kornblith
Ting Chen
Niki Parmar
Matthias Minderer
Mohammad Norouzi
AI4CE
193
26
0
23 May 2022
Siamese Object Tracking for Unmanned Aerial Vehicle: A Review and
  Comprehensive Analysis
Siamese Object Tracking for Unmanned Aerial Vehicle: A Review and Comprehensive Analysis
Changhong Fu
Kunhan Lu
Guang-Zheng Zheng
Junjie Ye
Ziang Cao
Bowen Li
Geng Lu
19
55
0
09 May 2022
Joint Forecasting of Panoptic Segmentations with Difference Attention
Joint Forecasting of Panoptic Segmentations with Difference Attention
Colin Graber
Cyril Jazra
Wenjie Luo
Liangyan Gui
A. Schwing
AI4TS
24
1
0
14 Apr 2022
Panoptic, Instance and Semantic Relations: A Relational Context Encoder
  to Enhance Panoptic Segmentation
Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation
Shubhankar Borse
Hyojin Park
H. Cai
Debasmit Das
Risheek Garrepalli
Fatih Porikli
ISeg
33
13
0
11 Apr 2022
Fashionformer: A simple, Effective and Unified Baseline for Human
  Fashion Segmentation and Recognition
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition
Shilin Xu
Xiangtai Li
Jingbo Wang
Guangliang Cheng
Yunhai Tong
Dacheng Tao
ViT
23
27
0
10 Apr 2022
Learning Local and Global Temporal Contexts for Video Semantic
  Segmentation
Learning Local and Global Temporal Contexts for Video Semantic Segmentation
Guolei Sun
Yun Liu
Henghui Ding
Min Wu
Luc Van Gool
25
32
0
07 Apr 2022
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
N. Kim
Dongwon Kim
Cuiling Lan
Wenjun Zeng
Suha Kwak
19
136
0
31 Mar 2022
SepViT: Separable Vision Transformer
SepViT: Separable Vision Transformer
Wei Li
Xing Wang
Xin Xia
Jie Wu
Jiashi Li
Xuefeng Xiao
Min Zheng
Shiping Wen
ViT
26
39
0
29 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
F. Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
24
28
0
24 Mar 2022
Sparse Instance Activation for Real-Time Instance Segmentation
Sparse Instance Activation for Real-Time Instance Segmentation
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Wenqiang Zhang
Qian Zhang
Chang Huang
Zhaoxiang Zhang
Wenyu Liu
ISeg
27
125
0
24 Mar 2022
Focal Modulation Networks
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
22
263
0
22 Mar 2022
The Principle of Diversity: Training Stronger Vision Transformers Calls
  for Reducing All Levels of Redundancy
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy
Tianlong Chen
Zhenyu (Allen) Zhang
Yu Cheng
Ahmed Hassan Awadallah
Zhangyang Wang
ViT
33
37
0
12 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with
  Transformers
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
21
296
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for
  Segmentation
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
19
13
0
08 Mar 2022
LwPosr: Lightweight Efficient Fine-Grained Head Pose Estimation
LwPosr: Lightweight Efficient Fine-Grained Head Pose Estimation
Naina Dhingra
16
16
0
07 Feb 2022
SA-VQA: Structured Alignment of Visual and Semantic Representations for
  Visual Question Answering
SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering
Peixi Xiong
Quanzeng You
Pei Yu
Zicheng Liu
Ying Wu
16
5
0
25 Jan 2022
RestoreFormer: High-Quality Blind Face Restoration from Undegraded
  Key-Value Pairs
RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs
Zhouxia Wang
Jiawei Zhang
Runjian Chen
Wenping Wang
Ping Luo
CVBM
13
109
0
17 Jan 2022
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
Golnaz Ghiasi
Xiuye Gu
Yin Cui
Tsung-Yi Lin
VLM
30
370
0
22 Dec 2021
MPViT: Multi-Path Vision Transformer for Dense Prediction
MPViT: Multi-Path Vision Transformer for Dense Prediction
Youngwan Lee
Jonghee Kim
Jeffrey Willette
Sung Ju Hwang
ViT
29
244
0
21 Dec 2021
Slot-VPS: Object-centric Representation Learning for Video Panoptic
  Segmentation
Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
Yi Zhou
Hui Zhang
Hana Lee
Shuyang Sun
Pingjun Li
Yangguang Zhu
ByungIn Yoo
Xiaojuan Qi
Jae-Joon Han
VOS
25
26
0
16 Dec 2021
QAHOI: Query-Based Anchors for Human-Object Interaction Detection
QAHOI: Query-Based Anchors for Human-Object Interaction Detection
Junwen Chen
Keiji Yanai
24
40
0
16 Dec 2021
DualFormer: Local-Global Stratified Transformer for Efficient Video
  Recognition
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
Yuxuan Liang
Pan Zhou
Roger Zimmermann
Shuicheng Yan
ViT
23
21
0
09 Dec 2021
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic
  Segmentation
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Haobo Yuan
Xiangtai Li
Yibo Yang
Guangliang Cheng
Jing Zhang
Yunhai Tong
Lefei Zhang
Dacheng Tao
MDE
35
42
0
05 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
82
2,265
0
02 Dec 2021
End-to-End Referring Video Object Segmentation with Multimodal
  Transformers
End-to-End Referring Video Object Segmentation with Multimodal Transformers
Adam Botach
Evgenii Zheltonozhskii
Chaim Baskin
VOS
17
140
0
29 Nov 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point
  Modeling
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
27
652
0
29 Nov 2021
Pruning Self-attentions into Convolutional Layers in Single Path
Pruning Self-attentions into Convolutional Layers in Single Path
Haoyu He
Jianfei Cai
Jing Liu
Zizheng Pan
Jing Zhang
Dacheng Tao
Bohan Zhuang
ViT
29
40
0
23 Nov 2021
DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
Renrui Zhang
Ziyao Zeng
Ziyu Guo
Xing Gao
Kexue Fu
Jianbo Shi
3DPC
22
26
0
19 Nov 2021
A Survey of Visual Transformers
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
71
330
0
11 Nov 2021
Video Instance Segmentation by Instance Flow Assembly
Video Instance Segmentation by Instance Flow Assembly
Xiang Li
Jinglu Wang
Xiao Li
Yan Lu
VOS
19
15
0
20 Oct 2021
ASFormer: Transformer for Action Segmentation
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
71
172
0
16 Oct 2021
Trans4Trans: Efficient Transformer for Transparent Object and Semantic
  Scene Segmentation in Real-World Navigation Assistance
Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance
Jiaming Zhang
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
30
69
0
20 Aug 2021
Image Fusion Transformer
Image Fusion Transformer
VS Vibashan
Jeya Maria Jose Valanarasu
Poojan Oza
Vishal M. Patel
ViT
30
116
0
19 Jul 2021
Trans4Trans: Efficient Transformer for Transparent Object Segmentation
  to Help Visually Impaired People Navigate in the Real World
Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World
Jiaming Zhang
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
36
61
0
07 Jul 2021
Focal Self-attention for Local-Global Interactions in Vision
  Transformers
Focal Self-attention for Local-Global Interactions in Vision Transformers
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
ViT
42
428
0
01 Jul 2021
Looking Outside the Window: Wide-Context Transformer for the Semantic
  Segmentation of High-Resolution Remote Sensing Images
Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images
L. Ding
Dong Lin
Shaofu Lin
Jing Zhang
Xiaojie Cui
Yuebin Wang
H. Tang
Lorenzo Bruzzone
ViT
21
97
0
29 Jun 2021
K-Net: Towards Unified Image Segmentation
K-Net: Towards Unified Image Segmentation
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
ISeg
32
356
0
28 Jun 2021
P2T: Pyramid Pooling Transformer for Scene Understanding
P2T: Pyramid Pooling Transformer for Scene Understanding
Yu-Huan Wu
Yun-Hai Liu
Xin Zhan
Mingg-Ming Cheng
ViT
24
219
0
22 Jun 2021
Efficient Self-supervised Vision Transformers for Representation
  Learning
Efficient Self-supervised Vision Transformers for Representation Learning
Chunyuan Li
Jianwei Yang
Pengchuan Zhang
Mei Gao
Bin Xiao
Xiyang Dai
Lu Yuan
Jianfeng Gao
ViT
30
209
0
17 Jun 2021
Previous
123
Next