Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.00759
Cited By
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"
50 / 119 papers shown
Title
Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation
Xiao Zhang
Xiangyu Han
Xiwen Lai
Yao Sun
Pei Zhang
Konrad Kording
34
0
0
08 Apr 2025
Hierarchical Vector Quantization for Unsupervised Action Segmentation
Federico Spurio
Emad Bahrami
Gianpiero Francesca
Juergen Gall
39
0
0
23 Dec 2024
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
F. Khan
Yanwei Pang
DiffM
37
1
0
05 Sep 2024
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?
Chen Liang
Qiang Guo
Xiaochao Qu
Luoqi Liu
Ting Liu
VOS
34
0
0
20 Aug 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
46
0
0
12 Jul 2024
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
29
0
0
14 Jun 2024
An Image is Worth 32 Tokens for Reconstruction and Generation
Qihang Yu
Mark Weber
XueQing Deng
Xiaohui Shen
Daniel Cremers
Liang-Chieh Chen
VLM
ViT
46
80
0
11 Jun 2024
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Inkyu Shin
Qihang Yu
Xiaohui Shen
In So Kweon
KuK-Jin Yoon
Liang-Chieh Chen
VGen
DiffM
66
1
0
04 Jun 2024
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Elham Amin Mansour
Ozan Unal
Suman Saha
Benjamin Bejar
Luc Van Gool
36
1
0
04 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
27
2
0
02 Apr 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
25
5
0
06 Mar 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
26
2
0
23 Feb 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
32
11
0
18 Jan 2024
Low-Resource Vision Challenges for Foundation Models
Yunhua Zhang
Hazel Doughty
Cees G. M. Snoek
VLM
22
5
0
09 Jan 2024
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
26
14
0
09 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
22
2
0
01 Nov 2023
3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers
Jieneng Chen
Jieru Mei
Xianhang Li
Yongyi Lu
Qihang Yu
...
M. Lungren
Lei Xing
Le Lu
Alan L. Yuille
Yuyin Zhou
MedIm
ViT
33
36
0
11 Oct 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li
Ting Yu
Yongming Rao
Jie Zhou
Jiwen Lu
30
12
0
21 Sep 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation
Zhaochong An
Guolei Sun
Zongwei Wu
Hao Tang
Luc Van Gool
VOS
23
4
0
14 Sep 2023
Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation
Michael Jungo
Beat Wolf
Andrii Maksai
C. Musat
Andreas Fischer
22
2
0
06 Sep 2023
RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs
Zhouxia Wang
Jiawei Zhang
Tianshui Chen
Wenping Wang
Ping Luo
16
16
0
14 Aug 2023
Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network
K. Yan
Xiaoli Yin
Yingda Xia
Fakai Wang
Shu Wang
...
Xiaoyu Bai
Jingren Zhou
Ling Zhang
Le Lu
Yu Shi
MedIm
32
5
0
17 Jul 2023
Efficient Multi-Task Scene Analysis with RGB-D Transformers
Söhnke Benedikt Fischedick
Daniel Seichter
Robin M. Schmidt
Leonard Rabes
H. Groß
25
9
0
08 Jun 2023
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Xiuye Gu
Yin Cui
Jonathan Huang
Abdullah M. Rashwan
X. Yang
...
Golnaz Ghiasi
Weicheng Kuo
Huizhong Chen
Liang-Chieh Chen
David A. Ross
ISeg
26
26
0
02 Jun 2023
HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation
Jian Ding
Nan Xue
Guisong Xia
Bernt Schiele
Dengxin Dai
ViT
17
30
0
22 May 2023
Intra-Batch Supervision for Panoptic Segmentation on High-Resolution Images
Daan de Geus
Gijs Dubbelman
SSeg
19
6
0
17 Apr 2023
Sketch-based Video Object Localization
Sangmin Woo
So-Yeong Jeon
Jinyoung Park
Minji Son
Sumin Lee
Changick Kim
11
0
0
02 Apr 2023
You Only Segment Once: Towards Real-Time Panoptic Segmentation
Jie Hu
Linyan Huang
Tianhe Ren
Shengchuan Zhang
Rongrong Ji
Liujuan Cao
SSeg
44
54
0
26 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR Perception
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
29
10
0
21 Mar 2023
Graph Transformer GANs for Graph-Constrained House Generation
H. Tang
Zhenyu Zhang
Humphrey Shi
Bo-wen Li
Lin Shao
N. Sebe
Radu Timofte
Luc Van Gool
41
19
0
14 Mar 2023
LMSeg: Language-guided Multi-dataset Segmentation
Qiang-feng Zhou
Yuang Liu
Chaohui Yu
Jingliang Li
Zhibin Wang
Fan Wang
VLM
13
18
0
27 Feb 2023
CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans
Jieneng Chen
Yingda Xia
Jiawen Yao
K. Yan
Jianpeng Zhang
...
Xin Chen
Jingren Zhou
Alan Yuille
Zai-De Liu
Ling Zhang
ViT
MedIm
28
15
0
28 Jan 2023
Skeleton-based Action Recognition through Contrasting Two-Stream Spatial-Temporal Networks
Chen Pang
Xuequan Lu
Lei Lyu
28
20
0
27 Jan 2023
Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting
Kaiwen Zhang
Jialun Peng
Jingjing Fu
Dong Liu
ViT
19
8
0
24 Jan 2023
Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation
S. D. Dao
Hengcan Shi
Dinh Q. Phung
Jianfei Cai
VLM
34
0
0
18 Jan 2023
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Haowei Wang
Jiayi Ji
Yiyi Zhou
Yongjian Wu
Xiaoshuai Sun
25
15
0
09 Jan 2023
Semi-MAE: Masked Autoencoders for Semi-supervised Vision Transformers
Haojie Yu
Kangnian Zhao
Xiaoming Xu
ViT
25
1
0
04 Jan 2023
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Huiyu Wang
Mitesh Singh
Lorenzo Torresani
EgoV
66
22
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
37
21
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
82
31
0
02 Jan 2023
PanDepth: Joint Panoptic Segmentation and Depth Completion
J. Lagos
Esa Rahtu
3DPC
VLM
28
1
0
29 Dec 2022
LUMix: Improving Mixup by Better Modelling Label Uncertainty
Shuyang Sun
Jieneng Chen
Ruifei He
Alan Yuille
Philip H. S. Torr
Song Bai
UQCV
NoLa
13
5
0
29 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
24
326
0
10 Nov 2022
Instance Segmentation with Cross-Modal Consistency
A. Z. Zhu
Vincent Casser
R. Mahjourian
Henrik Kretzschmar
Soren Pirk
ISeg
11
1
0
14 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
28
58
0
04 Oct 2022
A Review of Modern Approaches for Coronary Angiography Imaging Analysis
Maxim Y Popov
Temirgali Aimyshev
Eldar Ismailov
Ablay Bulegenov
S. Fazli
18
3
0
28 Sep 2022
PointScatter: Point Set Representation for Tubular Structure Extraction
Dong Wang
Zhao Zhang
Zi-Long Zhao
Yuhang Liu
Yihong Chen
Liwei Wang
3DPC
34
10
0
13 Sep 2022
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation
Nadine Behrmann
S. Golestaneh
Zico Kolter
Juergen Gall
M. Noroozi
22
72
0
01 Sep 2022
Multiple Instance Neuroimage Transformer
Ayush Singla
Qingyu Zhao
Daniel K. Do
Yuyin Zhou
K. Pohl
Ehsan Adeli
ViT
MedIm
11
11
0
19 Aug 2022
Flow-Guided Transformer for Video Inpainting
Kaiwen Zhang
Jingjing Fu
Dong Liu
ViT
21
68
0
14 Aug 2022
1
2
3
Next