Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,359 papers shown
Title
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Anton Voronov
Denis Kuznedelev
Mikhail Khoroshikh
Valentin Khrulkov
Dmitry Baranchuk
106
2
0
02 Dec 2024
Referring Video Object Segmentation via Language-aligned Track Selection
Seongchan Kim
Woojeong Jin
Sangbeom Lim
Heeji Yoon
Hyunwook Choi
Seungryong Kim
VOS
89
0
0
02 Dec 2024
SyncVIS: Synchronized Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
67
0
0
01 Dec 2024
EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Tong Jin
Feng Lu
Shuyu Hu
Chun Yuan
Yunpeng Liu
ViT
72
0
0
01 Dec 2024
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLM
VLM
3DV
83
1
0
29 Nov 2024
Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation
Finlay G. C. Hudson
W. Smith
VOS
VLM
71
0
0
28 Nov 2024
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
92
3
0
28 Nov 2024
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior
Li-Yuan Tsao
Hao-Wei Chen
Hao-Wei Chung
Deqing Sun
Chun-Yi Lee
Kelvin Chan
Ming Yang
DiffM
76
3
0
27 Nov 2024
HyperSeg: Towards Universal Visual Segmentation with Large Language Model
Cong Wei
Yujie Zhong
Haoxian Tan
Y. Liu
Zheng Zhao
Jie Hu
Yujiu Yang
VOS
MLLM
VLM
LRM
84
1
0
26 Nov 2024
Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps
Xue Xia
Randall Balestriero
Tao Zhang
L. Hurni
VOS
AI4TS
65
0
0
26 Nov 2024
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models
Harsh Goel
Sai Shankar Narasimhan
Oguzhan Akcin
Sandeep P. Chinchali
DiffM
87
2
0
25 Nov 2024
VideoOrion: Tokenizing Object Dynamics in Videos
Yicheng Feng
Yijiang Li
Wanpeng Zhang
Sipeng Zheng
Zongqing Lu
Sipeng Zheng
Zongqing Lu
104
1
0
25 Nov 2024
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs
Chen Xin
T. Motz
Andreas Hartel
Enkelejda Kasneci
69
0
0
23 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Steven G. McDonagh
Elliot J. Crowley
VLM
68
0
0
22 Nov 2024
DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines
Mizanur Rahman Jewel
Mohamed Elmahallawy
S. Madria
Samuel Frimpong
73
1
0
20 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Ziyi Wang
Y. Wang
Xumin Yu
Jie Zhou
Jiwen Lu
72
0
0
20 Nov 2024
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images
Xuechao Zou
Shun Zhang
Kai Li
Shiying Wang
Junliang Xing
Lei Jin
Congyan Lang
Pin Tao
61
1
0
20 Nov 2024
Chanel-Orderer: A Channel-Ordering Predictor for Tri-Channel Natural Images
Shen Li
Lei Jiang
Wei Wang
Hongwei Hu
Liang Li
67
0
0
20 Nov 2024
MGNiceNet: Unified Monocular Geometric Scene Understanding
Markus Schön
Michael Buchholz
Klaus C. J. Dietmayer
3DPC
77
0
0
18 Nov 2024
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data
Chika Maduabuchi
Ericmoore Jossou
Matteo Bucci
28
0
0
12 Nov 2024
Watermark Anything with Localized Messages
Tom Sander
Pierre Fernandez
Alain Durmus
Teddy Furon
Matthijs Douze
VLM
34
7
0
11 Nov 2024
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps
Xue Xia
Daiwei Zhang
Wenxuan Song
Wei Huang
L. Hurni
AI4TS
VLM
28
0
0
11 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
26
3
0
08 Nov 2024
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Zhitong Gao
Bingnan Li
Mathieu Salzmann
Xuming He
OOD
VLM
52
1
0
06 Nov 2024
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective
Qishuai Wen
Chun-Guang Li
ViT
32
0
0
05 Nov 2024
CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Jinchao Ge
Bowen Zhang
Akide Liu
Minh Hieu Phan
Qi Chen
Yangyang Shu
Yang Zhao
VLM
CLL
27
0
0
05 Nov 2024
GenXD: Generating Any 3D and 4D Scenes
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Z. Yang
Jianfeng Wang
G. Lee
Lijuan Wang
VGen
43
14
0
04 Nov 2024
Event-guided Low-light Video Semantic Segmentation
Zhen Yao
Mooi Choo Choo Chuah
50
6
0
01 Nov 2024
ZIM: Zero-Shot Image Matting for Anything
Beomyoung Kim
Chanyong Shin
Joonhyun Jeong
Hyungsik Jung
Se Yun Lee
Sewhan Chun
Dong-Hyun Hwang
Joonsang Yu
VLM
34
2
0
01 Nov 2024
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing
Naufal Suryanto
Andro Aprila Adiputra
Ahmada Yusril Kadiptya
Thi-Thu-Huong Le
Derry Pratama
Yongsu Kim
Howon Kim
DiffM
42
0
0
01 Nov 2024
OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction
Hongbo Zhao
Lue Fan
Yuntao Chen
Haochen Wang
Y. Yang
Xiaojuan Jin
Yixin Zhang
Gaofeng Meng
Zhaoxiang Zhang
44
1
0
30 Oct 2024
Unlocking Comics: The AI4VA Dataset for Visual Understanding
Peter Grönquist
Deblina Bhattacharjee
Bahar Aydemir
Baran Ozaydin
Tong Zhang
Mathieu Salzmann
Sabine Süsstrunk
21
0
0
27 Oct 2024
On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
Rajat Modi
Vibhav Vineet
Y. S. Rawat
33
1
0
25 Oct 2024
SegLLM: Multi-round Reasoning Segmentation
XuDong Wang
Shaolun Zhang
Shufan Li
Konstantinos Kallidromitis
Kehan Li
Yusuke Kato
Kazuki Kozuka
Trevor Darrell
VLM
LRM
50
1
0
24 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
23
0
0
23 Oct 2024
Is Smoothness the Key to Robustness? A Comparison of Attention and Convolution Models Using a Novel Metric
Baiyuan Chen
MLT
18
0
0
23 Oct 2024
PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting
Yu Wang
Xiaobao Wei
Ming Lu
Guoliang Kang
3DGS
26
5
0
23 Oct 2024
DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model
Zhixiong Nan
Xianghong Li
Tao Xiang
Jifeng Dai
ISeg
38
0
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng-Tao Xu
Nick Barnes
F. Khan
Salman Khan
Deng-Ping Fan
41
4
0
22 Oct 2024
Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation
Ruting Chi
Zhiyi Huang
Yuexing Han
ISeg
23
0
0
21 Oct 2024
Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment
Yankai Jiang
Wenhui Lei
Xiaofan Zhang
S. Zhang
MedIm
32
2
0
21 Oct 2024
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Ziming Huang
Xurui Li
Haotian Liu
Feng Xue
Yuzhe Wang
Yu Zhou
30
0
0
18 Oct 2024
DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering
Jiahao Lu
Jiacheng Deng
Ruijie Zhu
Yanzhe Liang
Wenfei Yang
Tianzhu Zhang
Xu Zhou
3DGS
33
5
0
17 Oct 2024
GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
26
1
0
17 Oct 2024
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Changcheng Xiao
Qiong Cao
Yujie Zhong
Xiang Zhang
Tao Wang
Canqun Yang
L. Lan
23
0
0
17 Oct 2024
Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation
Wenbo Xu
Yanan Wu
Haoran Jiang
Yang Wang
Qiang Wu
Jian Andrew Zhang
CLL
VLM
21
0
0
16 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLM
CLIP
24
0
0
16 Oct 2024
Order-aware Interactive Segmentation
Bin Wang
Anwesa Choudhuri
Meng Zheng
Zhongpai Gao
Benjamin Planche
Andong Deng
Qin Liu
Terrence Chen
Ulas Bagci
Ziyan Wu
VLM
100
1
0
16 Oct 2024
Overcoming Domain Limitations in Open-vocabulary Segmentation
Dongjun Hwang
Seong Joon Oh
Junsuk Choe
SSeg
OOD
52
0
0
15 Oct 2024
AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure
Yu Zhang
Kefeng Zheng
Fei Liu
Qingfu Zhang
Zhenkun Wang
29
2
0
14 Oct 2024
Previous
1
2
3
4
5
6
...
26
27
28
Next