Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,661 papers shown
Title
PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation
Haruya Ishikawa
Takumi Iida
Yoshinori Konishi
Yoshimitsu Aoki
168
4
0
19 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLM
CLIP
254
10
0
19 Mar 2024
Fusion Transformer with Object Mask Guidance for Image Forgery Analysis
Dimitrios Karageorgiou
Giorgos Kordopatis-Zilos
Symeon Papadopoulos
ViT
165
11
0
18 Mar 2024
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery
Computer Vision and Pattern Recognition (CVPR), 2024
Yuqi Zhang
Guanying Chen
Jiaxing Chen
Shuguang Cui
148
5
0
18 Mar 2024
EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding
European Conference on Computer Vision (ECCV), 2024
Wenhua Wu
Qi Wang
Guangming Wang
Junping Wang
Tiankun Zhao
Yang Liu
Dongchao Gao
Yanfeng Guo
Hesheng Wang
AI4CE
3DV
178
16
0
18 Mar 2024
Video Object Segmentation with Dynamic Query Modulation
IEEE International Conference on Multimedia and Expo (ICME), 2024
Hantao Zhou
Runze Hu
Xiu Li
VOS
159
3
0
18 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
165
8
0
14 Mar 2024
Renovating Names in Open-Vocabulary Segmentation Benchmarks
Neural Information Processing Systems (NeurIPS), 2024
Haiwen Huang
Songyou Peng
Dan Zhang
Andreas Geiger
VLM
194
5
0
14 Mar 2024
The NeRFect Match: Exploring NeRF Features for Visual Localization
European Conference on Computer Vision (ECCV), 2024
Qunjie Zhou
Maxim Maximov
Or Litany
Laura Leal-Taixé
205
26
0
14 Mar 2024
WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity
Qiyuan Wang
Y. Liu
Shang Zhao
Rong Liu
S. Kevin Zhou
210
1
0
14 Mar 2024
Faceptor: A Generalist Model for Face Perception
European Conference on Computer Vision (ECCV), 2024
Lixiong Qin
Mei Wang
Xuannan Liu
Yuhang Zhang
Weihong Deng
Xiaoshuai Song
Weiran Xu
Weihong Deng
CVBM
192
15
0
14 Mar 2024
RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes
European Conference on Computer Vision (ECCV), 2024
Thang-Anh-Quan Nguyen
Luis Roldão
Nathan Piasco
Moussâb Bennehar
D. Tsishkou
355
13
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language Interface
European Conference on Computer Vision (ECCV), 2024
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Jiaming Song
Bernt Schiele
Liwei Wang
VLM
254
22
0
14 Mar 2024
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
European Conference on Computer Vision (ECCV), 2024
Yizhe Xiong
Hui Chen
Tianxiang Hao
Zijia Lin
Jungong Han
Yuesong Zhang
Guoxin Wang
Yongjun Bao
Guiguang Ding
242
24
0
14 Mar 2024
When Semantic Segmentation Meets Frequency Aliasing
International Conference on Learning Representations (ICLR), 2024
Linwei Chen
Lin Gu
Ying Fu
330
15
0
14 Mar 2024
HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction
Computer Vision and Pattern Recognition (CVPR), 2024
Yi Zhou
Hui Zhang
Jiaqian Yu
Yifan Yang
Sangil Jung
Seungsang Park
ByungIn Yoo
3DPC
223
47
0
13 Mar 2024
Language-Grounded Dynamic Scene Graphs for Interactive Object Search with Mobile Manipulation
IEEE Robotics and Automation Letters (RA-L), 2024
Daniel Honerkamp
Martin Buchner
Fabien Despinoy
Tim Welschehold
Abhinav Valada
LM&Ro
256
75
0
13 Mar 2024
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Zicheng Zhang
Tong Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
QiXiang Ye
Wei Ke
VLM
205
7
0
13 Mar 2024
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
IEEE International Conference on Multimedia and Expo (ICME), 2024
Dingbang Li
Wenzhou Chen
Xin Lin
LLMAG
LM&Ro
159
11
0
13 Mar 2024
MoAI: Mixture of All Intelligence for Large Language and Vision Models
European Conference on Computer Vision (ECCV), 2024
Byung-Kwan Lee
Beomchan Park
Chae Won Kim
Yonghyun Ro
MLLM
VLM
355
31
0
12 Mar 2024
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions
Computer Vision and Pattern Recognition (CVPR), 2024
Chunlong Xia
Xinliang Wang
Feng Lv
Xin Hao
Yifeng Shi
ViT
338
118
0
12 Mar 2024
Chart4Blind: An Intelligent Interface for Chart Accessibility Conversion
International Conference on Intelligent User Interfaces (IUI), 2024
Omar Moured
Morris Baumgarten-Egemole
Alina Roitberg
Karin Muller
Thorsten Schwarz
Rainer Stiefelhagen
192
16
0
11 Mar 2024
Query-guided Prototype Evolution Network for Few-Shot Segmentation
IEEE transactions on multimedia (IEEE TMM), 2024
Runmin Cong
Hang Xiong
Jinpeng Chen
Wei Zhang
Qingming Huang
Yao Zhao
182
29
0
11 Mar 2024
Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning
Computer Vision and Pattern Recognition (CVPR), 2024
Woojin Ahn
G. Yang
H. Choi
M. Lim
113
29
0
10 Mar 2024
FrameQuant: Flexible Low-Bit Quantization for Transformers
International Conference on Machine Learning (ICML), 2024
Harshavardhan Adepu
Zhanpeng Zeng
Li Zhang
Vikas Singh
MQ
151
13
0
10 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Linwei Chen
Lin Gu
Ying Fu
643
71
0
08 Mar 2024
InstructGIE: Towards Generalizable Image Editing
European Conference on Computer Vision (ECCV), 2024
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
180
13
0
08 Mar 2024
ComFe: An Interpretable Head for Vision Transformers
Evelyn J. Mannix
H. Bondell
Howard Bondell
VLM
ViT
455
0
0
07 Mar 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
307
14
0
06 Mar 2024
DINOv2 based Self Supervised Learning For Few Shot Medical Image Segmentation
Lev Ayzenberg
Raja Giryes
H. Greenspan
174
9
0
05 Mar 2024
Deep Common Feature Mining for Efficient Video Semantic Segmentation
Yaoyan Zheng
Hongyu Yang
Di Huang
169
2
0
05 Mar 2024
Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Zijin Yin
Kongming Liang
Bing Li
Zhanyu Ma
Jun Guo
VLM
283
7
0
02 Mar 2024
A citizen science toolkit to collect human perceptions of urban environments using open street view images
Matthew Danish
SM Labib
Britta Ricker
Marco Helbich
152
19
0
29 Feb 2024
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
Niccolò Cavagnero
Gabriele Rosi
Claudia Cuttano
Francesca Pistilli
Marco Ciccone
Giuseppe Averta
Fabio Cermelli
294
35
0
29 Feb 2024
Feature boosting with efficient attention for scene parsing
Vivek Singh
Shailza Sharma
Fabio Cuzzolin
110
0
0
29 Feb 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
212
29
0
28 Feb 2024
Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling
David S. W. Williams
Matthew Gadd
Paul Newman
Daniele De Martini
UQCV
107
1
0
27 Feb 2024
An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
Guodong Sun
Yuting Peng
Lei Cheng
Mengya Xu
An-Chi Wang
Bo Wu
Hongliang Ren
Yang Zhang
140
3
0
27 Feb 2024
A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track
Zehui Chen
Qiuchen Wang
Zhenyu Li
Jiaming Liu
Shanghang Zhang
Feng Zhao
126
1
0
27 Feb 2024
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
Yichi Zhang
Ziqiao Ma
Xiaofeng Gao
Suhaila Shakiah
Qiaozi Gao
Joyce Chai
MLLM
VLM
327
74
0
26 Feb 2024
ConSept: Continual Semantic Segmentation via Adapter-based Vision Transformer
Bowen Dong
Guanglei Yang
W. Zuo
Lei Zhang
175
3
0
26 Feb 2024
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
Pau de Jorge
Riccardo Volpi
P. Dokania
Juil Sock
Grégory Rogez
DiffM
356
9
0
26 Feb 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
226
4
0
23 Feb 2024
Outlier detection by ensembling uncertainty with negative objectness
Anja Delić
Matej Grcić
Sinisa Segvic
UQCV
296
26
0
23 Feb 2024
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition
Lianghui Zhu
Junwei Zhou
Yan Liu
Xin Hao
Wenyu Liu
Xinggang Wang
VLM
230
17
0
22 Feb 2024
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
593
19
0
22 Feb 2024
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Hiroshi Murase
VLM
212
1
0
21 Feb 2024
Cell Graph Transformer for Nuclei Classification
Wei Lou
Guanbin Li
Xiang Wan
Haofeng Li
ViT
MedIm
257
11
0
20 Feb 2024
Object-level Geometric Structure Preserving for Natural Image Stitching
Wenxiao Cai
Wankou Yang
204
10
0
20 Feb 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
Fabio Tosi
Youming Zhang
Ziren Gong
Erik Sandström
S. Mattoccia
Martin R. Oswald
Matteo Poggi
3DGS
485
93
0
20 Feb 2024
Previous
1
2
3
...
18
19
20
...
32
33
34
Next