Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,359 papers shown
Title
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
28
40
0
07 Apr 2023
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOS
MLLM
VLM
30
199
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
42
11
0
06 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
20
6,796
0
05 Apr 2023
Uncertainty estimation in Deep Learning for Panoptic segmentation
Michael J. Smith
F. Ferrie
OOD
UQCV
28
0
0
04 Apr 2023
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
Cong Han
Yujie Zhong
Dengjie Li
Kai Han
Lin Ma
VLM
SSeg
6
30
0
03 Apr 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
20
61
0
03 Apr 2023
Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization
Mingze Yuan
Yingda Xia
Hexin Dong
Zi Chen
Jiawen Yao
...
Bin Dong
Jing Zhou
Le Lu
Ling Zhang
Li Zhang
OOD
MedIm
16
20
0
01 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
21
46
0
30 Mar 2023
MobileInst: Video Instance Segmentation on the Mobile
Renhong Zhang
Tianheng Cheng
Shusheng Yang
Hao Jiang
Shuai Zhang
...
Xin Li
Xiaowen Ying
Dashan Gao
Wenyu Liu
Xinggang Wang
31
7
0
30 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffM
VLM
41
130
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yifan Jiang
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
22
6
0
30 Mar 2023
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Ukcheol Shin
Kyunghyun Lee
In So Kweon
Jean Oh
22
20
0
30 Mar 2023
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
Jie Qin
Jie Wu
Pengxiang Yan
Ming Li
Ren Yuxi
...
Yitong Wang
Rui Wang
Shilei Wen
X. Pan
Xingang Wang
SSeg
VLM
16
88
0
30 Mar 2023
Masked and Adaptive Transformer for Exemplar Based Image Translation
Changlong Jiang
Fei Gao
Biao Ma
Yuhao Lin
N. Wang
Gang Xu
22
18
0
30 Mar 2023
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval
Finlay G. C. Hudson
W. Smith
ViT
38
1
0
30 Mar 2023
Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video
Wenzheng Zeng
Yang Xiao
Sicheng Wei
Jinfang Gan
Xintao Zhang
Z. Cao
Zhiwen Fang
Joey Tianyi Zhou
CVBM
13
11
0
28 Mar 2023
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
Zijian Zhou
Miaojing Shi
Holger Caesar
27
18
0
28 Mar 2023
Mask-Free Video Instance Segmentation
Lei Ke
Martin Danelljan
Henghui Ding
Yu-Wing Tai
Chi-Keung Tang
F. I. F. Richard Yu
24
22
0
28 Mar 2023
OpenInst: A Simple Query-Based Method for Open-World Instance Segmentation
Cheng Wang
Guoli Wang
Qian Zhang
Pengning Guo
Wenyu Liu
Xinggang Wang
ISeg
VLM
19
7
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming Yang
F. Khan
ViT
40
84
0
27 Mar 2023
You Only Segment Once: Towards Real-Time Panoptic Segmentation
Jie Hu
Linyan Huang
Tianhe Ren
Shengchuan Zhang
Rongrong Ji
Liujuan Cao
SSeg
44
54
0
26 Mar 2023
Affordance Grounding from Demonstration Video to Target Image
Joya Chen
Difei Gao
Kevin Qinghong Lin
Mike Zheng Shou
19
24
0
26 Mar 2023
BoxVIS: Video Instance Segmentation with Box Annotations
Minghan Li
Lei Zhang
ISeg
VOS
30
1
0
26 Mar 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
28
9
0
25 Mar 2023
OPDMulti: Openable Part Detection for Multiple Objects
Xiaohao Sun
Hanxiao Jiang
Manolis Savva
Angel X. Chang
AI4CE
25
15
0
24 Mar 2023
Category Query Learning for Human-Object Interaction Classification
Chi Xie
Fangao Zeng
Yue Hu
Shuang Liang
Yichen Wei
VLM
24
20
0
24 Mar 2023
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
WonJun Moon
Sangeek Hyun
S. Park
Dongchan Park
Jae-Pil Heo
ViT
41
106
0
24 Mar 2023
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning
Zhenyu Xie
Zaiyu Huang
Xin Dong
Fuwei Zhao
Haoye Dong
Xijin Zhang
Feida Zhu
Xiaodan Liang
3DH
21
91
0
24 Mar 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
Cong Wei
Brendan Duke
R. Jiang
P. Aarabi
Graham W. Taylor
Florian Shkurti
ViT
46
14
0
24 Mar 2023
Position-Guided Point Cloud Panoptic Segmentation Transformer
Zeqi Xiao
Wenwei Zhang
Tai Wang
Chen Change Loy
Dahua Lin
Jiangmiao Pang
ViT
3DPC
21
12
0
23 Mar 2023
Zero-guidance Segmentation Using Zero Segment Labels
Pitchaporn Rewatbowornwong
Nattanat Chatthee
E. Chuangsuwanich
Supasorn Suwajanakorn
VLM
25
11
0
23 Mar 2023
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
Xiangtai Li
Haobo Yuan
Wenwei Zhang
Guangliang Cheng
Jiangmiao Pang
Chen Change Loy
ViT
VOS
38
20
0
22 Mar 2023
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Weijia Wu
Yuzhong Zhao
Mike Zheng Shou
Hong Zhou
Chunhua Shen
31
140
0
21 Mar 2023
BoxSnake: Polygonal Instance Segmentation with Box Supervision
Rui Yang
Lin Song
Yixiao Ge
Xiu Li
ISeg
19
18
0
21 Mar 2023
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Ruiqi Wang
A. Patil
Fenggen Yu
Hao Zhang
13
1
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
38
259
0
20 Mar 2023
Open-vocabulary Panoptic Segmentation with Embedding Modulation
Xi Chen
Shuang Li
Ser-Nam Lim
Antonio Torralba
Hengshuang Zhao
VLM
19
30
0
20 Mar 2023
Generative Semantic Segmentation
Jia-Qing Chen
Jiachen Lu
Xiatian Zhu
Li Zhang
GAN
ISeg
VLM
30
38
0
20 Mar 2023
Reliability in Semantic Segmentation: Are We on the Right Track?
Pau de Jorge
Riccardo Volpi
Philip H. S. Torr
Grégory Rogez
UQCV
24
19
0
20 Mar 2023
Neural Refinement for Absolute Pose Regression with Feature Synthesis
Shuai Chen
Yash Bhalgat
Xinghui Li
Jiawang Bian
Kejie Li
Zirui Wang
V. Prisacariu
26
18
0
17 Mar 2023
LERF: Language Embedded Radiance Fields
J. Kerr
C. Kim
Ken Goldberg
Angjoo Kanazawa
Matthew Tancik
15
349
0
16 Mar 2023
MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation
Nicolás Ayobi
Alejandra Pérez-Rondón
Santiago Rodríguez
Pablo Arbelaez
MedIm
43
18
0
16 Mar 2023
Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers
Zhibo Yang
Sounak Mondal
Seoyoung Ahn
Ruoyu Xue
G. Zelinsky
Minh Hoai
Dimitris Samaras
11
10
0
16 Mar 2023
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
Kunyang Han
Yong-Jin Liu
Jun Hao Liew
Henghui Ding
Yunchao Wei
...
Yitong Wang
Yansong Tang
Yujiu Yang
Jiashi Feng
Yao-Min Zhao
VLM
34
35
0
16 Mar 2023
RSFNet: A White-Box Image Retouching Approach using Region-Specific Color Filters
Wenqi Ouyang
Yi Dong
Xiaoyang Kang
Peiran Ren
Xin Xu
Xuansong Xie
20
7
0
15 Mar 2023
High-level Feature Guided Decoding for Semantic Segmentation
Ye Huang
Di Kang
Shenghua Gao
Wen Li
Lixin Duan
18
0
0
15 Mar 2023
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
Junjie He
Pengyu Li
Yifeng Geng
Xuansong Xie
ISeg
VLM
30
44
0
15 Mar 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
Hao Zhang
Feng Li
Xueyan Zou
Siyi Liu
Chun-yue Li
Jianfeng Gao
Jianwei Yang
Lei Zhang
ObjD
VLM
22
150
0
14 Mar 2023
LoG-CAN: local-global Class-aware Network for semantic segmentation of remote sensing images
Xiaowen Ma
Mengting Ma
Chenlu Hu
Zhiyuan Song
Zi-Shu Zhao
Tian Feng
Wei Zhang
38
12
0
14 Mar 2023
Previous
1
2
3
...
22
23
24
...
26
27
28
Next