ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.00759
  4. Cited By
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
    ViT
ArXivPDFHTML

Papers citing "MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"

19 / 119 papers shown
Title
DeepLab2: A TensorFlow Library for Deep Labeling
DeepLab2: A TensorFlow Library for Deep Labeling
Mark Weber
Huiyu Wang
Siyuan Qiao
Jun Xie
Maxwell D. Collins
...
Laura Leal-Taixe
Alan Yuille
Florian Schroff
Hartwig Adam
Liang-Chieh Chen
VLM
25
45
0
17 Jun 2021
CAT: Cross Attention in Vision Transformer
CAT: Cross Attention in Vision Transformer
Hezheng Lin
Xingyi Cheng
Xiangyu Wu
Fan Yang
Dong Shen
Zhongyuan Wang
Qing Song
Wei Yuan
ViT
27
149
0
10 Jun 2021
Dual-stream Network for Visual Recognition
Dual-stream Network for Visual Recognition
Mingyuan Mao
Renrui Zhang
Honghui Zheng
Peng Gao
Teli Ma
Yan Peng
Errui Ding
Baochang Zhang
Shumin Han
ViT
18
63
0
31 May 2021
Instances as Queries
Instances as Queries
Yuxin Fang
Shusheng Yang
Xinggang Wang
Yu Li
Chen Fang
Ying Shan
Bin Feng
Wenyu Liu
ISeg
42
255
0
05 May 2021
Visual Saliency Transformer
Visual Saliency Transformer
Nian Liu
Ni Zhang
Kaiyuan Wan
Ling Shao
Junwei Han
ViT
253
352
0
25 Apr 2021
Multiscale Vision Transformers
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
19
1,221
0
22 Apr 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,086
0
29 Mar 2021
Multi-Scale Vision Longformer: A New Vision Transformer for
  High-Resolution Image Encoding
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
ViT
23
328
0
29 Mar 2021
Scalable Vision Transformers with Hierarchical Pooling
Scalable Vision Transformers with Hierarchical Pooling
Zizheng Pan
Bohan Zhuang
Jing Liu
Haoyu He
Jianfei Cai
ViT
25
126
0
19 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
TransFG: A Transformer Architecture for Fine-grained Recognition
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
33
375
0
14 Mar 2021
Perceiver: General Perception with Iterative Attention
Perceiver: General Perception with Iterative Attention
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
48
973
0
04 Mar 2021
Panoramic Panoptic Segmentation: Towards Complete Surrounding
  Understanding via Unsupervised Contrastive Learning
Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive Learning
A. Jaus
Kailun Yang
Rainer Stiefelhagen
32
36
0
01 Mar 2021
Big Bird: Transformers for Longer Sequences
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
259
2,013
0
28 Jul 2020
Conditional Convolutions for Instance Segmentation
Conditional Convolutions for Instance Segmentation
Zhi Tian
Chunhua Shen
Hao Chen
ISeg
173
597
0
12 Mar 2020
SegSort: Segmentation by Discriminative Sorting of Segments
SegSort: Segmentation by Discriminative Sorting of Segments
Jyh-Jing Hwang
Stella X. Yu
Jianbo Shi
Maxwell D. Collins
Tien-Ju Yang
Xiao Zhang
Liang-Chieh Chen
180
148
0
15 Oct 2019
CBNet: A Novel Composite Backbone Network Architecture for Object
  Detection
CBNet: A Novel Composite Backbone Network Architecture for Object Detection
Yudong Liu
Yongtao Wang
Siwei Wang
Tingting Liang
Qijie Zhao
Zhi Tang
Haibin Ling
ObjD
207
244
0
09 Sep 2019
Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
Zifeng Wu
Chunhua Shen
A. Hengel
SSeg
247
1,491
0
30 Nov 2016
Aggregated Residual Transformations for Deep Neural Networks
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,216
0
16 Nov 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
216
7,923
0
17 Aug 2015
Previous
123