Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,661 papers shown
Title
RMem: Restricted Memory Banks Improve Video Object Segmentation
Junbao Zhou
Ziqi Pang
Yu-Xiong Wang
VOS
350
17
0
12 Jun 2024
Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn
Christian Rupprecht
230
17
0
12 Jun 2024
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
Zhensong Xu
Jiangtao Yao
Chengjing Wu
Ting Liu
Luoqi Liu
230
1
0
12 Jun 2024
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation
Zhongzhen Huang
Yankai Jiang
Rongzhao Zhang
Shaoting Zhang
Xiaofan Zhang
MedIm
289
12
0
11 Jun 2024
ROADWork: A Dataset and Benchmark for Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Anurag Ghosh
Shen Zheng
Shen Zheng
Juan R. Alvarez-Padilla
Hailiang Zhu
Hailiang Zhu
Michael Cardei
Nicholas Dunn
Christoph Mertz
Srinivasa Narasimhan
260
7
0
11 Jun 2024
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
International Conference on Machine Learning (ICML), 2024
Shijie Lian
Ziyi Zhang
Hua Li
Wenjie Li
Laurence Tianruo Yang
Sam Kwong
Runmin Cong
VLM
266
33
0
10 Jun 2024
Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zhiyuan Cheng
Cheng Han
James Liang
Qifan Wang
Xiangyu Zhang
Dongfang Liu
AAML
188
9
0
09 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
Computer Vision and Pattern Recognition (CVPR), 2024
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
544
21
0
09 Jun 2024
ProMotion: Prototypes As Motion Learners
Computer Vision and Pattern Recognition (CVPR), 2024
Yawen Lu
Dongfang Liu
Qifan Wang
Cheng Han
Yiming Cui
Zhiwen Cao
Xueling Zhang
Yingjie Victor Chen
Heng Fan
DiffM
353
9
0
07 Jun 2024
Semantic Segmentation on VSPW Dataset through Masked Video Consistency
Chen Liang
Qiang Guo
Chongkai Yu
Chengjing Wu
Ting Liu
Luoqi Liu
212
2
0
07 Jun 2024
3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation
Feiyu Pan
Hao Fang
Xiankai Lu
173
3
0
07 Jun 2024
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Neural Information Processing Systems (NeurIPS), 2024
Yang Sui
Yanyu Li
Vidit Goel
Yerlan Idelbayev
Junli Cao
Ju Hu
Dhritiman Sagar
Bo Yuan
Sergey Tulyakov
Jian Ren
MQ
216
36
0
06 Jun 2024
Matching Anything by Segmenting Anything
Computer Vision and Pattern Recognition (CVPR), 2024
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
229
47
0
06 Jun 2024
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu
Jifei Che
Han Li
Chengjing Wu
Ting Liu
Luoqi Liu
224
0
0
06 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Pu Cao
Pu Cao
Liulei Li
Huadong Ma
198
3
0
06 Jun 2024
Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy
Yunho Kim
Jeong Hyun Lee
Choongin Lee
Juhyeok Mun
D. Youm
Jeongsoo Park
Jemin Hwangbo
163
8
0
05 Jun 2024
P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images
Tao Zhang
Shiqing Wei
Yikang Zhou
M. Luo
Wenling You
Shunping Ji
168
7
0
05 Jun 2024
FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping
Yuzhou Ji
He Zhu
Junshu Tang
Wuyi Liu
Zhizhong Zhang
Yuan Xie
Xin Tan
216
22
0
04 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Fahad Shahbaz Khan
VLM
ISeg
651
19
0
04 Jun 2024
Segmentation-Free Guidance for Text-to-Image Diffusion Models
K. Azarian
Debasmit Das
Qiqi Hou
Fatih Porikli
VLM
187
1
0
03 Jun 2024
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Thanh-Dat Truong
Utsav Prabhu
Dongyi Wang
Bhiksha Raj
Susan Gauch
J. Subbiah
Khoa Luu
313
4
0
03 Jun 2024
MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images
Ke-Lei Wang
Pin-Hsuan Chou
Young-Ching Chou
Chia-Jen Liu
Cheng-Kuan Lin
Yu-Chee Tseng
154
1
0
03 Jun 2024
On the Nonlinearity of Layer Normalization
Yunhao Ni
Yuxin Guo
Junlong Jia
Lei Huang
284
7
0
03 Jun 2024
Object Aware Egocentric Online Action Detection
Joungbin An
Yunsu Park
Hyolim Kang
Seon Joo Kim
EgoV
172
1
0
03 Jun 2024
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Yunheng Li
Zhongyu Li
Quansheng Zeng
Qibin Hou
Ming-Ming Cheng
VLM
286
18
0
02 Jun 2024
Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation
Xinyue Chen
Miaojing Shi
244
0
0
01 Jun 2024
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Biao Wu
Diankai Zhang
Sihan Gao
Cheng-yong Zheng
Shaoli Liu
Ning Wang
254
0
0
01 Jun 2024
Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Tiancheng Shen
Jun Hao Liew
Long Mai
Lu Qi
Jiashi Feng
Jiaya Jia
DiffM
159
3
0
31 May 2024
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee
S. Hwang
Suha Kwak
278
7
0
31 May 2024
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
Selim Kuzucu
Kemal Oksuz
Jonathan Sadeghi
P. Dokania
220
8
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
269
11
0
30 May 2024
View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields
Haodi He
Jiahui Lei
Adam W. Harley
Leonidas Guibas
3DV
162
4
0
30 May 2024
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation
Niclas Vodisch
Kürsat Petek
Markus Kappeler
Abhinav Valada
Wolfram Burgard
VLM
226
8
0
29 May 2024
BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation
Tengbo Wang
Yu Bai
ISeg
226
1
0
27 May 2024
Memorize What Matters: Emergent Scene Decomposition from Multitraverse
Yiming Li
Zehong Wang
Yue Wang
Zhiding Yu
Zan Gojcic
Marco Pavone
Chen Feng
Jose M. Alvarez
3DGS
150
3
0
27 May 2024
HDC: Hierarchical Semantic Decoding with Counting Assistance for Generalized Referring Expression Segmentation
Zhuoyan Luo
Yinghao Wu
Yong-Jin Liu
Yicheng Xiao
Jinqiang Cui
Yujiu Yang
254
0
0
24 May 2024
U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation
Bingyu Li
Da Zhang
Zhiyuan Zhao
Junyu Gao
Xuelong Li
260
14
0
24 May 2024
Synergistic Global-space Camera and Human Reconstruction from Videos
Yizhou Zhao
Tuanfeng Y. Wang
Bhiksha Raj
Min Xu
Jimei Yang
Chun-Hao Paul Huang
3DGS
3DH
162
8
0
23 May 2024
Efficient Robot Learning for Perception and Mapping
Niclas Vodisch
SSL
173
0
0
23 May 2024
RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting
Zhiheng Feng
Wenhua Wu
Hesheng Wang
3DGS
210
3
0
23 May 2024
Tuning-free Universally-Supervised Semantic Segmentation
IEEE Access (IEEE Access), 2024
Xiaobo Yang
Xiaojin Gong
VLM
236
3
0
23 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Katherine Xu
Lingzhi Zhang
Jianbo Shi
382
28
0
23 May 2024
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Yuhang Yang
Wei Zhai
Chengfeng Wang
Chengjun Yu
Yang Cao
Zheng-jun Zha
286
16
0
22 May 2024
Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
Dingwen Zhang
Hao Li
Diqi He
Nian Liu
Lechao Cheng
Jingdong Wang
Junwei Han
VLM
165
11
0
22 May 2024
Influence of Water Droplet Contamination for Transparency Segmentation
Volker Knauthe
Paul Weitz
Thomas Pollabauer
Tristan Wirth
Arne Rak
Arjan Kuijper
Dieter W. Fellner
293
1
0
21 May 2024
DLAFormer: An End-to-End Transformer For Document Layout Analysis
Jiawei Wang
Kai Hu
Qiang Huo
3DV
ViT
181
10
0
20 May 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
European Conference on Computer Vision (ECCV), 2024
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
306
62
0
19 May 2024
HARIS: Human-Like Attention for Reference Image Segmentation
IEEE International Conference on Multimedia and Expo (ICME), 2024
Mengxi Zhang
Heqing Lian
Yiming Liu
Jie Chen
VLM
244
0
0
17 May 2024
NeRO: Neural Road Surface Reconstruction
Ruibo Wang
Song Zhang
Ping Huang
Donghai Zhang
Haoyu Chen
3DV
159
2
0
17 May 2024
Grounded 3D-LLM with Referent Tokens
Yilun Chen
Shuai Yang
Haifeng Huang
Tai Wang
Ruiyuan Lyu
Runsen Xu
Dahua Lin
Jiangmiao Pang
304
75
0
16 May 2024
Previous
1
2
3
...
15
16
17
...
32
33
34
Next