Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
48 / 1,648 papers shown
Title
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning
A. Jaus
Kailun Yang
Rainer Stiefelhagen
185
21
0
21 Jun 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
International Journal of Computer Vision (IJCV), 2022
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Jianlong Wu
Yong Liu
Dacheng Tao
ViT
261
46
0
19 Jun 2022
REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
Jaehyuk Heo
YongGi Jeong
Sunwoo Kim
Jaehee Kim
Pilsung Kang
94
0
0
18 Jun 2022
Forecasting of depth and ego-motion with transformers and self-supervision
International Conference on Pattern Recognition (ICPR), 2022
Houssem-eddine Boulahbal
A. Voicila
Andrew I. Comport
ViT
MDE
162
3
0
15 Jun 2022
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention
Quanzeng You
Jiang Wang
Peng Chu
Andre Abrantes
Zicheng Liu
VOS
101
1
0
14 Jun 2022
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Neural Information Processing Systems (NeurIPS), 2022
Jinguo Zhu
Xizhou Zhu
Wenhai Wang
Xiaohua Wang
Jiaming Song
Xiaogang Wang
Jifeng Dai
MoMe
MoE
253
82
0
09 Jun 2022
VITA: Video Instance Segmentation via Object Token Association
Neural Information Processing Systems (NeurIPS), 2022
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VOS
232
122
0
09 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
313
509
0
06 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
Neural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
573
500
0
02 Jun 2022
Differentiable Soft-Masked Attention
A. Athar
Jonathon Luiten
Alexander Hermans
Deva Ramanan
Bastian Leibe
83
0
0
01 Jun 2022
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction
Han Cai
Junyan Li
Muyan Hu
Chuang Gan
Song Han
281
80
0
29 May 2022
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Bruno Sauvalle
A. de La Fortelle
3DPC
234
12
0
26 May 2022
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
Neural Information Processing Systems (NeurIPS), 2022
Alexander Kolesnikov
André Susano Pinto
Lucas Beyer
Xiaohua Zhai
Jeremiah Harmsen
N. Houlsby
256
80
0
20 May 2022
HCFormer: Unified Image Segmentation with Hierarchical Clustering
Teppei Suzuki
249
1
0
20 May 2022
Vision Transformer Adapter for Dense Predictions
International Conference on Learning Representations (ICLR), 2022
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
739
738
0
17 May 2022
Transformer Scale Gate for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
Hengcan Shi
Munawar Hayat
Jianfei Cai
ViT
197
29
0
14 May 2022
Where in the World is this Image? Transformer-based Geo-localization in the Wild
European Conference on Computer Vision (ECCV), 2022
Shraman Pramanick
E. Nowara
Joshua Gleason
Carlos D. Castillo
Rama Chellappa
ViT
162
54
0
29 Apr 2022
Joint Forecasting of Panoptic Segmentations with Difference Attention
Computer Vision and Pattern Recognition (CVPR), 2022
Colin Graber
Cyril Jazra
Wenjie Luo
Liangyan Gui
Alex Schwing
AI4TS
115
4
0
14 Apr 2022
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition
European Conference on Computer Vision (ECCV), 2022
Shilin Xu
Xiangtai Li
Jingbo Wang
Guangliang Cheng
Yunhai Tong
Dacheng Tao
ViT
229
27
0
10 Apr 2022
Learning Local and Global Temporal Contexts for Video Semantic Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Guolei Sun
Yun Liu
Henghui Ding
Min Wu
Luc Van Gool
241
36
0
07 Apr 2022
End-to-End Instance Edge Detection
Xueyan Zou
Haotian Liu
Yong Jae Lee
129
2
0
06 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
European Conference on Computer Vision (ECCV), 2022
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
351
339
0
04 Apr 2022
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
Haoyu He
Jianfei Cai
Zizheng Pan
Jing Liu
Jing Zhang
Dacheng Tao
Bohan Zhuang
164
24
0
04 Apr 2022
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
IEEE Transactions on Image Processing (IEEE TIP), 2022
Zhenyu Li
Xuyang Wang
Xianming Liu
Junjun Jiang
MDE
256
251
0
03 Apr 2022
StructToken : Rethinking Semantic Segmentation with Structural Prior
Fangjian Lin
Zhanhao Liang
Miao Zheng
Junjun He
Kaibing Chen
Sheng Tian
420
62
0
23 Mar 2022
GOSS: Towards Generalized Open-set Semantic Segmentation
The Visual Computer (TVC), 2022
Jie Hong
Weihong Li
Junlin Han
Jiyang Zheng
Pengfei Fang
Mehrtash Harandi
L. Petersson
VLM
198
21
0
23 Mar 2022
Focal Modulation Networks
Neural Information Processing Systems (NeurIPS), 2022
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
277
368
0
22 Mar 2022
Test-time Adaptation with Slot-Centric Models
International Conference on Machine Learning (ICML), 2022
Mihir Prabhudesai
Anirudh Goyal
S. Paul
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gaurav Aggarwal
Thomas Kipf
Deepak Pathak
Katerina Fragkiadaki
TTA
289
11
0
21 Mar 2022
Active Token Mixer
AAAI Conference on Artificial Intelligence (AAAI), 2022
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
164
22
0
11 Mar 2022
NeRFocus: Neural Radiance Field for 3D Synthetic Defocus
Yinhuai Wang
Shu-Yi Yang
Yu Fei Hu
Jian Zhang
200
14
0
10 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Kailai Li
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
317
485
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
European Conference on Computer Vision (ECCV), 2022
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
268
15
0
08 Mar 2022
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Jean-Michel Fortin
Olivier Gamache
Vincent Grondin
F. Pomerleau
Philippe Giguère
206
31
0
03 Mar 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
Computer Vision and Pattern Recognition (CVPR), 2022
Feng Li
Hao Zhang
Shi-guang Liu
Jian Guo
L. Ni
Lei Zhang
ViT
549
924
0
02 Mar 2022
TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation
R. Liu
Kailun Yang
Alina Roitberg
Kailai Li
Kunyu Peng
Huayao Liu
Yaonan Wang
Rainer Stiefelhagen
ViT
188
53
0
27 Feb 2022
Visual Attention Network
Computational Visual Media (CVM), 2022
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViT
VLM
432
850
0
20 Feb 2022
Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings
International Journal of Computer Vision (IJCV), 2022
Wei Yin
Yifan Liu
Chunhua Shen
Baichuan Sun
Anton Van Den Hengel
VLM
245
11
0
04 Feb 2022
Attention-based Proposals Refinement for 3D Object Detection
Minh-Quan Dao
Elwan Héry
Vincent Frémont
3DPC
85
2
0
18 Jan 2022
Pyramid Fusion Transformer for Semantic Segmentation
IEEE transactions on multimedia (IEEE TMM), 2022
Zipeng Qin
Jianbo Liu
Xiaoling Zhang
Maoqing Tian
Aojun Zhou
Shuai Yi
Jiaming Song
ViT
387
27
0
11 Jan 2022
SeMask: Semantically Masked Transformers for Semantic Segmentation
Jitesh Jain
Anukriti Singh
Nikita Orlov
Zilong Huang
Jiachen Li
Steven Walton
Humphrey Shi
ViT
241
117
0
23 Dec 2021
Lite Vision Transformer with Enhanced Self-Attention
Computer Vision and Pattern Recognition (CVPR), 2021
Chenglin Yang
Yilin Wang
Jianming Zhang
Chentao Song
Zijun Wei
Zhe Lin
Alan Yuille
ViT
203
146
0
20 Dec 2021
Mask2Former for Video Instance Segmentation
Bowen Cheng
Anwesa Choudhuri
Ishan Misra
Alexander Kirillov
Rohit Girdhar
Alex Schwing
VOS
193
209
0
20 Dec 2021
Efficient Self-Ensemble for Semantic Segmentation
British Machine Vision Conference (BMVC), 2021
Walid Bousselham
Guillaume Thibault
Lucas Pagano
Archana Machireddy
Joe W. Gray
Y. Chang
Xubo B. Song
ViT
248
32
0
26 Nov 2021
Dense Prediction with Attentive Feature Aggregation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Yung-Hsu Yang
Thomas E. Huang
Min Sun
Samuel Rota Buló
Peter Kontschieder
Feng Yu
175
8
0
01 Nov 2021
Recent Advances of Continual Learning in Computer Vision: An Overview
IET Computer Vision (ICV), 2021
Haoxuan Qu
Hossein Rahmani
Kepeng Xu
Bryan M. Williams
Jun Liu
VLM
CLL
370
93
0
23 Sep 2021
ACDC: The Adverse Conditions Dataset with Correspondences for Robust Semantic Driving Scene Perception
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Daniel Gehrig
Haoran Wang
Ke Li
René Zurbrugg
Arpit Jadon
Wim Abbeloos
Daniel Olmeda Reino
Luc Van Gool
Dengxin Dai
258
11
0
27 Apr 2021
Quality-Aware Network for Human Parsing
IEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Pu Cao
Q. Song
Zhihui Wang
Zhiwei Liu
Songcen Xu
Zhihao Li
196
30
0
10 Mar 2021
Xception: Deep Learning with Depthwise Separable Convolutions
Computer Vision and Pattern Recognition (CVPR), 2016
François Chollet
MDE
BDL
PINN
2.5K
16,491
0
07 Oct 2016
Previous
1
2
3
...
31
32
33