Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,661 papers shown
Title
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
258
32
0
02 Jan 2023
Deep Learning Technique for Human Parsing: A Survey and Outlook
International Journal of Computer Vision (IJCV), 2023
Pu Cao
Wenhe Jia
Shane Li
Q. Song
ViT
299
31
0
01 Jan 2023
PanDepth: Joint Panoptic Segmentation and Depth Completion
VISIGRAPP (VISIGRAPP), 2022
J. Lagos
Esa Rahtu
3DPC
VLM
263
2
0
29 Dec 2022
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
160
5
0
28 Dec 2022
Reversible Column Networks
International Conference on Learning Representations (ICLR), 2022
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
242
85
0
22 Dec 2022
Generalized Decoding for Pixel, Image, and Language
Computer Vision and Pattern Recognition (CVPR), 2022
Xueyan Zou
Zi-Yi Dou
Jianwei Yang
Zhe Gan
Linjie Li
...
Lu Yuan
Nanyun Peng
Lijuan Wang
Yong Jae Lee
Jianfeng Gao
VLM
MLLM
ObjD
264
324
0
21 Dec 2022
Weakly supervised training of universal visual concepts for multi-domain semantic segmentation
International Journal of Computer Vision (IJCV), 2022
Petra Bevandić
Marin Orsic
Ivan Grubišić
Josip Saric
Sinisa Segvic
366
6
0
20 Dec 2022
Planning-oriented Autonomous Driving
Computer Vision and Pattern Recognition (CVPR), 2022
Yi Hu
Jiazhi Yang
Li Chen
Keyu Li
Chonghao Sima
...
Xiaosong Jia
Qiang Liu
Jifeng Dai
Yu Qiao
Hongyang Li
190
999
0
20 Dec 2022
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Computer Vision and Pattern Recognition (CVPR), 2022
Yawar Siddiqui
Lorenzo Porzi
Samuel Rota Buló
Norman Muller
Matthias Nießner
Angela Dai
Peter Kontschieder
191
179
0
19 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
IEEE International Conference on Computer Vision (ICCV), 2022
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
322
248
0
15 Dec 2022
QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query
Neural Information Processing Systems (NeurIPS), 2022
Yabo Xiao
Kai Su
Xiaojuan Wang
Dongdong Yu
Lei Jin
Mingshu He
Zehuan Yuan
3DH
183
27
0
15 Dec 2022
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with Class-Aware Cross-Domain Transformers
R. Gong
Qin Wang
Dengxin Dai
Luc Van Gool
ViT
144
6
0
14 Dec 2022
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
172
52
0
13 Dec 2022
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
International Conference on Learning Representations (ICLR), 2022
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
Xinyu Wang
ViT
238
31
0
13 Dec 2022
OAMixer: Object-aware Mixing Layer for Vision Transformers
H. Kang
Sangwoo Mo
Jinwoo Shin
VLM
224
5
0
13 Dec 2022
Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation
K. Azarian
Debasmit Das
Hyojin Park
Fatih Porikli
3DH
OOD
277
3
0
12 Dec 2022
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Bo Yin
Xuying Zhang
Qibin Hou
Bo Sun
Deng-Ping Fan
Luc Van Gool
194
106
0
10 Dec 2022
RCDT: Relational Remote Sensing Change Detection with Transformer
Kaixuan Lu
Xiao Huang
ViT
123
9
0
09 Dec 2022
Towards Accurate Ground Plane Normal Estimation from Ego-Motion
Italian National Conference on Sensors (INS), 2022
Jiaxin Zhang
Wei Sui
Qian Zhang
Tao Chen
Cong Yang
133
5
0
08 Dec 2022
Latent Graph Representations for Critical View of Safety Assessment
IEEE Transactions on Medical Imaging (IEEE TMI), 2022
Aditya Murali
Deepak Alapatt
Pietro Mascagni
Armine Vardazaryan
Alain Garcia
Nariaki Okamoto
Didier Mutter
N. Padoy
MedIm
331
41
0
08 Dec 2022
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Computer Vision and Pattern Recognition (CVPR), 2022
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
259
39
0
07 Dec 2022
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
VLM
251
239
0
07 Dec 2022
Framework-agnostic Semantically-aware Global Reasoning for Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Mir Rayat Imtiaz Hossain
Leonid Sigal
James J. Little
ViT
135
0
0
06 Dec 2022
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation
Lihua Fu
Haoyue Tian
Xiang Zhai
Pan Gao
Xiaojiang Peng
ViT
115
14
0
06 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
203
108
0
06 Dec 2022
Semantic-aware Message Broadcasting for Efficient Unsupervised Domain Adaptation
IEEE Transactions on Image Processing (IEEE TIP), 2022
Xin Li
Cuiling Lan
Guoqiang Wei
Zhibo Chen
171
6
0
06 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLM
MLLM
289
325
0
05 Dec 2022
Mask Matching Transformer for Few-Shot Segmentation
Neural Information Processing Systems (NeurIPS), 2022
Siyu Jiao
Gengwei Zhang
Shant Navasardyan
Ling-Hao Chen
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
168
42
0
05 Dec 2022
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Wentong Li
Wenyu Liu
Jianke Zhu
Miaomiao Cui
Risheng Yu
Xia Hua
Lei Zhang
ISeg
305
53
0
03 Dec 2022
3D Segmentation of Humans in Point Clouds with Synthetic Data
IEEE International Conference on Computer Vision (ICCV), 2022
Ayca Takmaz
Jonas Schult
Irem Kaftan
Mertcan Akccay
Bastian Leibe
R. Sumner
Francis Engelmann
Siyu Tang
3DH
302
29
0
01 Dec 2022
Superpoint Transformer for 3D Scene Instance Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Jiahao Sun
Chunmei Qing
Junpeng Tan
Xiangmin Xu
3DPC
186
150
0
28 Nov 2022
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding
IEEE International Conference on Computer Vision (ICCV), 2022
Favyen Bastani
Piper Wolters
Ritwik Gupta
Joe Ferdinando
Aniruddha Kembhavi
293
169
0
28 Nov 2022
Multi-Modal Few-Shot Temporal Action Detection
Sauradip Nag
Mengmeng Xu
Xiatian Zhu
Juan-Manuel Perez-Rua
Guohao Li
Yi-Zhe Song
Tao Xiang
VLM
170
8
0
27 Nov 2022
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
International Conference on Machine Learning (ICML), 2022
Huaishao Luo
Junwei Bao
Youzheng Wu
Xiaodong He
Tianrui Li
VLM
223
190
0
27 Nov 2022
Prototype as Query for Few Shot Semantic Segmentation
Leilei Cao
Yibo Guo
Ye Yuan
Qiangguo Jin
ViT
173
16
0
27 Nov 2022
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Britty Baby
Daksh Thapar
Mustafa Chasmai
Tamajit Banerjee
Kunal Dargan
A. Suri
Subhashis Banerjee
Chetan Arora
287
37
0
26 Nov 2022
Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Daoan Zhang
Chenming Li
Haoquan Li
Wen-Fong Huang
Lingyun Huang
Jianguo Zhang
208
21
0
26 Nov 2022
RbA: Segmenting Unknown Regions Rejected by All
IEEE International Conference on Computer Vision (ICCV), 2022
Nazir Nayal
Mısra Yavuz
João F. Henriques
Fatma Guney
UQCV
186
64
0
25 Nov 2022
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
Fabio Cermelli
Matthieu Cord
Arthur Douillard
CLL
VLM
208
35
0
25 Nov 2022
Aggregated Text Transformer for Scene Text Detection
Zhao Zhou
Xiangcheng Du
Yingbin Zheng
Cheng Jin
ViT
170
1
0
25 Nov 2022
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
IEEE International Conference on Robotics and Automation (ICRA), 2022
Ya Lu
Yuqiao Chen
Nicholas Ruozzi
Yu Xiang
218
28
0
21 Nov 2022
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
Jiaru Jia
Ming-Yuan Liu
Jiake Xie
Xin Chen
Hong Zhang
Xin Jiang
Aiqing Yang
154
0
0
21 Nov 2022
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Computer Vision and Pattern Recognition (CVPR), 2022
Haoran You
Yunyang Xiong
Xiaoliang Dai
Bichen Wu
Peizhao Zhang
Haoqi Fan
Peter Vajda
Yingyan Lin
373
43
0
18 Nov 2022
Delving into Transformer for Incremental Semantic Segmentation
Zekai Xu
Mingying Zhang
Jiayue Hou
Xing Gong
Chuan Wen
Chengjie Wang
Junge Zhang
CLL
126
2
0
18 Nov 2022
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Computer Vision and Pattern Recognition (CVPR), 2022
Hao Li
Jinguo Zhu
Xiaohu Jiang
Xizhou Zhu
Jiaming Song
...
Xiaohua Wang
Yu Qiao
Xiaogang Wang
Wenhai Wang
Jifeng Dai
MLLM
150
66
0
17 Nov 2022
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Computer Vision and Pattern Recognition (CVPR), 2022
Weijie Su
Xizhou Zhu
Chenxin Tao
Lewei Lu
Bin Li
Gao Huang
Yu Qiao
Xiaogang Wang
Jie Zhou
Jifeng Dai
189
54
0
17 Nov 2022
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
Computer Vision and Pattern Recognition (CVPR), 2022
Yuang Zhang
Tiancai Wang
Xiangyu Zhang
VOT
251
207
0
17 Nov 2022
TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation
Zhongying Deng
Yanqi Chen
Lihao Liu
Shujun Wang
Rihuan Ke
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
177
5
0
17 Nov 2022
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
Jiaheng Liu
Tong He
Honghui Yang
Rui Su
Jiayi Tian
Junran Wu
Hongcheng Guo
Ke Xu
Wanli Ouyang
ISeg
127
16
0
17 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
194
11
0
16 Nov 2022
Previous
1
2
3
...
30
31
32
33
34
Next