ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,359 papers shown
Title
Representation Separation for Semantic Segmentation with Vision
  Transformers
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
19
5
0
28 Dec 2022
Reversible Column Networks
Reversible Column Networks
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
29
53
0
22 Dec 2022
Generalized Decoding for Pixel, Image, and Language
Generalized Decoding for Pixel, Image, and Language
Xueyan Zou
Zi-Yi Dou
Jianwei Yang
Zhe Gan
Linjie Li
...
Lu Yuan
Nanyun Peng
Lijuan Wang
Yong Jae Lee
Jianfeng Gao
VLM
MLLM
ObjD
13
240
0
21 Dec 2022
Weakly supervised training of universal visual concepts for multi-domain
  semantic segmentation
Weakly supervised training of universal visual concepts for multi-domain semantic segmentation
Petra Bevandić
Marin Orsic
Ivan Grubišić
Josip Saric
Sinisa Segvic
15
5
0
20 Dec 2022
Planning-oriented Autonomous Driving
Planning-oriented Autonomous Driving
Yi Hu
Jiazhi Yang
Li Chen
Keyu Li
Chonghao Sima
...
Xiaosong Jia
Qiang Liu
Jifeng Dai
Yu Qiao
Hongyang Li
33
587
0
20 Dec 2022
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Yawar Siddiqui
Lorenzo Porzi
Samuel Rota Buló
Norman Muller
Matthias Nießner
Angela Dai
Peter Kontschieder
40
128
0
19 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
23
157
0
15 Dec 2022
QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware
  Part-Level Query
QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query
Yabo Xiao
Kai Su
Xiaojuan Wang
Dongdong Yu
Lei Jin
Mingshu He
Zehuan Yuan
3DH
11
16
0
15 Dec 2022
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with
  Class-Aware Cross-Domain Transformers
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with Class-Aware Cross-Domain Transformers
R. Gong
Qin Wang
Dengxin Dai
Luc Van Gool
ViT
11
4
0
14 Dec 2022
Look Before You Match: Instance Understanding Matters in Video Object
  Segmentation
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
28
39
0
13 Dec 2022
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group
  Propagation
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
X. Wang
ViT
30
21
0
13 Dec 2022
OAMixer: Object-aware Mixing Layer for Vision Transformers
OAMixer: Object-aware Mixing Layer for Vision Transformers
H. Kang
Sangwoo Mo
Jinwoo Shin
VLM
29
4
0
13 Dec 2022
Test-time Adaptation vs. Training-time Generalization: A Case Study in
  Human Instance Segmentation using Keypoints Estimation
Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation
K. Azarian
Debasmit Das
Hyojin Park
Fatih Porikli
3DH
OOD
6
3
0
12 Dec 2022
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
Bo Yin
Xuying Zhang
Qibin Hou
Bo Sun
Deng-Ping Fan
Luc Van Gool
18
51
0
10 Dec 2022
RCDT: Relational Remote Sensing Change Detection with Transformer
RCDT: Relational Remote Sensing Change Detection with Transformer
Kaixuan Lu
Xiao Huang
ViT
17
8
0
09 Dec 2022
Towards Accurate Ground Plane Normal Estimation from Ego-Motion
Towards Accurate Ground Plane Normal Estimation from Ego-Motion
Jiaxin Zhang
Wei Sui
Qian Zhang
Tao Chen
Cong Yang
19
5
0
08 Dec 2022
Latent Graph Representations for Critical View of Safety Assessment
Latent Graph Representations for Critical View of Safety Assessment
Aditya Murali
Deepak Alapatt
Pietro Mascagni
Armine Vardazaryan
Alain Garcia
Nariaki Okamoto
Didier Mutter
N. Padoy
MedIm
18
19
0
08 Dec 2022
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
16
26
0
07 Dec 2022
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
VLM
19
167
0
07 Dec 2022
Framework-agnostic Semantically-aware Global Reasoning for Segmentation
Framework-agnostic Semantically-aware Global Reasoning for Segmentation
Mir Rayat Imtiaz Hossain
Leonid Sigal
James J. Little
ViT
17
0
0
06 Dec 2022
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for
  Semantic Segmentation
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation
Lihua Fu
Haoyue Tian
Xiang Zhai
Pan Gao
Xiaojiang Peng
ViT
22
9
0
06 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
14
66
0
06 Dec 2022
Semantic-aware Message Broadcasting for Efficient Unsupervised Domain
  Adaptation
Semantic-aware Message Broadcasting for Efficient Unsupervised Domain Adaptation
Xin Li
Cuiling Lan
Guoqiang Wei
Zhibo Chen
25
4
0
06 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual
  Learning
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLM
MLLM
45
244
0
05 Dec 2022
Mask Matching Transformer for Few-Shot Segmentation
Mask Matching Transformer for Few-Shot Segmentation
Siyu Jiao
Gengwei Zhang
Shant Navasardyan
Ling-Hao Chen
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
29
27
0
05 Dec 2022
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Wentong Li
Wenyu Liu
Jianke Zhu
Miaomiao Cui
Risheng Yu
Xia Hua
Lei Zhang
ISeg
24
30
0
03 Dec 2022
3D Segmentation of Humans in Point Clouds with Synthetic Data
3D Segmentation of Humans in Point Clouds with Synthetic Data
Ayca Takmaz
Jonas Schult
Irem Kaftan
Mertcan Akccay
Bastian Leibe
R. Sumner
Francis Engelmann
Siyu Tang
3DH
14
23
0
01 Dec 2022
Superpoint Transformer for 3D Scene Instance Segmentation
Superpoint Transformer for 3D Scene Instance Segmentation
Jiahao Sun
Chunmei Qing
Junpeng Tan
Xiangmin Xu
3DPC
34
103
0
28 Nov 2022
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image
  Understanding
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding
F. Bastani
Piper Wolters
Ritwik Gupta
Joe Ferdinando
Aniruddha Kembhavi
19
98
0
28 Nov 2022
Multi-Modal Few-Shot Temporal Action Detection
Multi-Modal Few-Shot Temporal Action Detection
Sauradip Nag
Mengmeng Xu
Xiatian Zhu
Juan-Manuel Perez-Rua
Bernard Ghanem
Yi-Zhe Song
Tao Xiang
VLM
22
6
0
27 Nov 2022
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary
  Semantic Segmentation
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Huaishao Luo
Junwei Bao
Youzheng Wu
Xiaodong He
Tianrui Li
VLM
24
144
0
27 Nov 2022
Prototype as Query for Few Shot Semantic Segmentation
Prototype as Query for Few Shot Semantic Segmentation
Leilei Cao
Yibo Guo
Ye Yuan
Qiangguo Jin
ViT
20
10
0
27 Nov 2022
From Forks to Forceps: A New Framework for Instance Segmentation of
  Surgical Instruments
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments
Britty Baby
Daksh Thapar
Mustafa Chasmai
Tamajit Banerjee
Kunal Dargan
A. Suri
Subhashis Banerjee
Chetan Arora
16
26
0
26 Nov 2022
Rethinking Alignment and Uniformity in Unsupervised Image Semantic
  Segmentation
Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation
Daoan Zhang
Chenming Li
Haoquan Li
Wen-Fong Huang
Lingyun Huang
Jianguo Zhang
25
20
0
26 Nov 2022
RbA: Segmenting Unknown Regions Rejected by All
RbA: Segmenting Unknown Regions Rejected by All
Nazir Nayal
Mısra Yavuz
João F. Henriques
Fatma Guney
UQCV
19
46
0
25 Nov 2022
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
Fabio Cermelli
Matthieu Cord
Arthur Douillard
CLL
VLM
16
19
0
25 Nov 2022
Aggregated Text Transformer for Scene Text Detection
Aggregated Text Transformer for Scene Text Detection
Zhao Zhou
Xiangcheng Du
Yingbin Zheng
Cheng Jin
ViT
25
1
0
25 Nov 2022
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Ya Lu
Yuqiao Chen
Nicholas Ruozzi
Yu Xiang
16
23
0
21 Nov 2022
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
Jiaru Jia
Ming-Yu Liu
Jiake Xie
Xin Chen
Hong Zhang
Xin Jiang
Aiqing Yang
22
0
0
21 Nov 2022
Castling-ViT: Compressing Self-Attention via Switching Towards
  Linear-Angular Attention at Vision Transformer Inference
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Haoran You
Yunyang Xiong
Xiaoliang Dai
Bichen Wu
Peizhao Zhang
Haoqi Fan
Peter Vajda
Yingyan Lin
27
31
0
18 Nov 2022
Delving into Transformer for Incremental Semantic Segmentation
Delving into Transformer for Incremental Semantic Segmentation
Zekai Xu
Mingying Zhang
Jiayue Hou
Xing Gong
Chuan Wen
Chengjie Wang
Junge Zhang
CLL
19
1
0
18 Nov 2022
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and
  Vision-Language Tasks
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li
Jinguo Zhu
Xiaohu Jiang
Xizhou Zhu
Hongsheng Li
...
Xiaohua Wang
Yu Qiao
Xiaogang Wang
Wenhai Wang
Jifeng Dai
MLLM
13
55
0
17 Nov 2022
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual
  Information
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Weijie Su
Xizhou Zhu
Chenxin Tao
Lewei Lu
Bin Li
Gao Huang
Yu Qiao
Xiaogang Wang
Jie Zhou
Jifeng Dai
31
41
0
17 Nov 2022
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained
  Object Detectors
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
Yuang Zhang
Tiancai Wang
Xiangyu Zhang
VOT
20
127
0
17 Nov 2022
TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation
TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation
Zhongying Deng
Yanqi Chen
Lihao Liu
Shujun Wang
Rihuan Ke
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
6
3
0
17 Nov 2022
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
Jiaheng Liu
Tong He
Honghui Yang
Rui Su
Jiayi Tian
Junran Wu
Hongcheng Guo
Ke Xu
Wanli Ouyang
ISeg
18
14
0
17 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
21
9
0
16 Nov 2022
A Generalized Framework for Video Instance Segmentation
A Generalized Framework for Video Instance Segmentation
Miran Heo
Sukjun Hwang
Jeongseok Hyun
Hanju Kim
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VLM
17
41
0
16 Nov 2022
A Unified Mutual Supervision Framework for Referring Expression
  Segmentation and Generation
A Unified Mutual Supervision Framework for Referring Expression Segmentation and Generation
Shijia Huang
Feng Li
Hao Zhang
Siyi Liu
Lei Zhang
Liwei Wang
22
5
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
56
673
0
14 Nov 2022
Previous
123...2425262728
Next