ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

48 / 1,648 papers shown
Title
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for
  Mobile Agents via Unsupervised Contrastive Learning
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning
A. Jaus
Kailun Yang
Rainer Stiefelhagen
185
21
0
21 Jun 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary
  Algorithm
EATFormer: Improving Vision Transformer Inspired by Evolutionary AlgorithmInternational Journal of Computer Vision (IJCV), 2022
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Jianlong Wu
Yong Liu
Dacheng Tao
ViT
261
46
0
19 Jun 2022
REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
Jaehyuk Heo
YongGi Jeong
Sunwoo Kim
Jaehee Kim
Pilsung Kang
94
0
0
18 Jun 2022
Forecasting of depth and ego-motion with transformers and
  self-supervision
Forecasting of depth and ego-motion with transformers and self-supervisionInternational Conference on Pattern Recognition (ICPR), 2022
Houssem-eddine Boulahbal
A. Voicila
Andrew I. Comport
ViTMDE
162
3
0
15 Jun 2022
Consistent Video Instance Segmentation with Inter-Frame Recurrent
  Attention
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention
Quanzeng You
Jiang Wang
Peng Chu
Andre Abrantes
Zicheng Liu
VOS
101
1
0
14 Jun 2022
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional
  MoEs
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEsNeural Information Processing Systems (NeurIPS), 2022
Jinguo Zhu
Xizhou Zhu
Wenhai Wang
Xiaohua Wang
Jiaming Song
Xiaogang Wang
Jifeng Dai
MoMeMoE
253
82
0
09 Jun 2022
VITA: Video Instance Segmentation via Object Token Association
VITA: Video Instance Segmentation via Object Token AssociationNeural Information Processing Systems (NeurIPS), 2022
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VOS
232
122
0
09 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object
  Detection and Segmentation
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and SegmentationComputer Vision and Pattern Recognition (CVPR), 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
313
509
0
06 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
EfficientFormer: Vision Transformers at MobileNet SpeedNeural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
573
500
0
02 Jun 2022
Differentiable Soft-Masked Attention
Differentiable Soft-Masked Attention
A. Athar
Jonathon Luiten
Alexander Hermans
Deva Ramanan
Bastian Leibe
83
0
0
01 Jun 2022
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense
  Prediction
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction
Han Cai
Junyan Li
Muyan Hu
Chuang Gan
Song Han
281
80
0
29 May 2022
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Unsupervised Multi-object Segmentation Using Attention and Soft-argmaxIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Bruno Sauvalle
A. de La Fortelle
3DPC
234
12
0
26 May 2022
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
UViM: A Unified Modeling Approach for Vision with Learned Guiding CodesNeural Information Processing Systems (NeurIPS), 2022
Alexander Kolesnikov
André Susano Pinto
Lucas Beyer
Xiaohua Zhai
Jeremiah Harmsen
N. Houlsby
256
80
0
20 May 2022
HCFormer: Unified Image Segmentation with Hierarchical Clustering
HCFormer: Unified Image Segmentation with Hierarchical Clustering
Teppei Suzuki
249
1
0
20 May 2022
Vision Transformer Adapter for Dense Predictions
Vision Transformer Adapter for Dense PredictionsInternational Conference on Learning Representations (ICLR), 2022
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
739
738
0
17 May 2022
Transformer Scale Gate for Semantic Segmentation
Transformer Scale Gate for Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2022
Hengcan Shi
Munawar Hayat
Jianfei Cai
ViT
197
29
0
14 May 2022
Where in the World is this Image? Transformer-based Geo-localization in
  the Wild
Where in the World is this Image? Transformer-based Geo-localization in the WildEuropean Conference on Computer Vision (ECCV), 2022
Shraman Pramanick
E. Nowara
Joshua Gleason
Carlos D. Castillo
Rama Chellappa
ViT
162
54
0
29 Apr 2022
Joint Forecasting of Panoptic Segmentations with Difference Attention
Joint Forecasting of Panoptic Segmentations with Difference AttentionComputer Vision and Pattern Recognition (CVPR), 2022
Colin Graber
Cyril Jazra
Wenjie Luo
Liangyan Gui
Alex Schwing
AI4TS
115
4
0
14 Apr 2022
Fashionformer: A simple, Effective and Unified Baseline for Human
  Fashion Segmentation and Recognition
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and RecognitionEuropean Conference on Computer Vision (ECCV), 2022
Shilin Xu
Xiangtai Li
Jingbo Wang
Guangliang Cheng
Yunhai Tong
Dacheng Tao
ViT
229
27
0
10 Apr 2022
Learning Local and Global Temporal Contexts for Video Semantic
  Segmentation
Learning Local and Global Temporal Contexts for Video Semantic SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Guolei Sun
Yun Liu
Henghui Ding
Min Wu
Luc Van Gool
241
36
0
07 Apr 2022
End-to-End Instance Edge Detection
End-to-End Instance Edge Detection
Xueyan Zou
Haotian Liu
Yong Jae Lee
129
2
0
06 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked AutoencodersEuropean Conference on Computer Vision (ECCV), 2022
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
351
339
0
04 Apr 2022
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Dynamic Focus-aware Positional Queries for Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2022
Haoyu He
Jianfei Cai
Zizheng Pan
Jing Liu
Jing Zhang
Dacheng Tao
Bohan Zhuang
164
24
0
04 Apr 2022
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
BinsFormer: Revisiting Adaptive Bins for Monocular Depth EstimationIEEE Transactions on Image Processing (IEEE TIP), 2022
Zhenyu Li
Xuyang Wang
Xianming Liu
Junjun Jiang
MDE
256
251
0
03 Apr 2022
StructToken : Rethinking Semantic Segmentation with Structural Prior
StructToken : Rethinking Semantic Segmentation with Structural Prior
Fangjian Lin
Zhanhao Liang
Miao Zheng
Junjun He
Kaibing Chen
Sheng Tian
420
62
0
23 Mar 2022
GOSS: Towards Generalized Open-set Semantic Segmentation
GOSS: Towards Generalized Open-set Semantic SegmentationThe Visual Computer (TVC), 2022
Jie Hong
Weihong Li
Junlin Han
Jiyang Zheng
Pengfei Fang
Mehrtash Harandi
L. Petersson
VLM
198
21
0
23 Mar 2022
Focal Modulation Networks
Focal Modulation NetworksNeural Information Processing Systems (NeurIPS), 2022
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
277
368
0
22 Mar 2022
Test-time Adaptation with Slot-Centric Models
Test-time Adaptation with Slot-Centric ModelsInternational Conference on Machine Learning (ICML), 2022
Mihir Prabhudesai
Anirudh Goyal
S. Paul
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gaurav Aggarwal
Thomas Kipf
Deepak Pathak
Katerina Fragkiadaki
TTA
289
11
0
21 Mar 2022
Active Token Mixer
Active Token MixerAAAI Conference on Artificial Intelligence (AAAI), 2022
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
164
22
0
11 Mar 2022
NeRFocus: Neural Radiance Field for 3D Synthetic Defocus
NeRFocus: Neural Radiance Field for 3D Synthetic Defocus
Yinhuai Wang
Shu-Yi Yang
Yu Fei Hu
Jian Zhang
200
14
0
10 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with
  Transformers
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Kailai Li
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
317
485
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for
  Segmentation
RankSeg: Adaptive Pixel Classification with Image Category Ranking for SegmentationEuropean Conference on Computer Vision (ECCV), 2022
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOSVLM
268
15
0
08 Mar 2022
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Instance Segmentation for Autonomous Log Grasping in Forestry OperationsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Jean-Michel Fortin
Olivier Gamache
Vincent Grondin
F. Pomerleau
Philippe Giguère
206
31
0
03 Mar 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
DN-DETR: Accelerate DETR Training by Introducing Query DeNoisingComputer Vision and Pattern Recognition (CVPR), 2022
Feng Li
Hao Zhang
Shi-guang Liu
Jian Guo
L. Ni
Lei Zhang
ViT
549
924
0
02 Mar 2022
TransKD: Transformer Knowledge Distillation for Efficient Semantic
  Segmentation
TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation
R. Liu
Kailun Yang
Alina Roitberg
Kailai Li
Kunyu Peng
Huayao Liu
Yaonan Wang
Rainer Stiefelhagen
ViT
188
53
0
27 Feb 2022
Visual Attention Network
Visual Attention NetworkComputational Visual Media (CVM), 2022
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViTVLM
432
850
0
20 Feb 2022
Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings
Scaling up Multi-domain Semantic Segmentation with Sentence EmbeddingsInternational Journal of Computer Vision (IJCV), 2022
Wei Yin
Yifan Liu
Chunhua Shen
Baichuan Sun
Anton Van Den Hengel
VLM
245
11
0
04 Feb 2022
Attention-based Proposals Refinement for 3D Object Detection
Attention-based Proposals Refinement for 3D Object Detection
Minh-Quan Dao
Elwan Héry
Vincent Frémont
3DPC
85
2
0
18 Jan 2022
Pyramid Fusion Transformer for Semantic Segmentation
Pyramid Fusion Transformer for Semantic SegmentationIEEE transactions on multimedia (IEEE TMM), 2022
Zipeng Qin
Jianbo Liu
Xiaoling Zhang
Maoqing Tian
Aojun Zhou
Shuai Yi
Jiaming Song
ViT
387
27
0
11 Jan 2022
SeMask: Semantically Masked Transformers for Semantic Segmentation
SeMask: Semantically Masked Transformers for Semantic Segmentation
Jitesh Jain
Anukriti Singh
Nikita Orlov
Zilong Huang
Jiachen Li
Steven Walton
Humphrey Shi
ViT
241
117
0
23 Dec 2021
Lite Vision Transformer with Enhanced Self-Attention
Lite Vision Transformer with Enhanced Self-AttentionComputer Vision and Pattern Recognition (CVPR), 2021
Chenglin Yang
Yilin Wang
Jianming Zhang
Chentao Song
Zijun Wei
Zhe Lin
Alan Yuille
ViT
203
146
0
20 Dec 2021
Mask2Former for Video Instance Segmentation
Mask2Former for Video Instance Segmentation
Bowen Cheng
Anwesa Choudhuri
Ishan Misra
Alexander Kirillov
Rohit Girdhar
Alex Schwing
VOS
193
209
0
20 Dec 2021
Efficient Self-Ensemble for Semantic Segmentation
Efficient Self-Ensemble for Semantic SegmentationBritish Machine Vision Conference (BMVC), 2021
Walid Bousselham
Guillaume Thibault
Lucas Pagano
Archana Machireddy
Joe W. Gray
Y. Chang
Xubo B. Song
ViT
248
32
0
26 Nov 2021
Dense Prediction with Attentive Feature Aggregation
Dense Prediction with Attentive Feature AggregationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Yung-Hsu Yang
Thomas E. Huang
Min Sun
Samuel Rota Buló
Peter Kontschieder
Feng Yu
175
8
0
01 Nov 2021
Recent Advances of Continual Learning in Computer Vision: An Overview
Recent Advances of Continual Learning in Computer Vision: An OverviewIET Computer Vision (ICV), 2021
Haoxuan Qu
Hossein Rahmani
Kepeng Xu
Bryan M. Williams
Jun Liu
VLMCLL
370
93
0
23 Sep 2021
ACDC: The Adverse Conditions Dataset with Correspondences for Robust Semantic Driving Scene Perception
ACDC: The Adverse Conditions Dataset with Correspondences for Robust Semantic Driving Scene PerceptionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Daniel Gehrig
Haoran Wang
Ke Li
René Zurbrugg
Arpit Jadon
Wim Abbeloos
Daniel Olmeda Reino
Luc Van Gool
Dengxin Dai
258
11
0
27 Apr 2021
Quality-Aware Network for Human Parsing
Quality-Aware Network for Human ParsingIEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Pu Cao
Q. Song
Zhihui Wang
Zhiwei Liu
Songcen Xu
Zhihao Li
196
30
0
10 Mar 2021
Xception: Deep Learning with Depthwise Separable Convolutions
Xception: Deep Learning with Depthwise Separable ConvolutionsComputer Vision and Pattern Recognition (CVPR), 2016
François Chollet
MDEBDLPINN
2.5K
16,491
0
07 Oct 2016
Previous
123...313233